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PACKAGING t'lil.I. [.INKS H)R HI V-OHRIViiD RETROVIRAL VICTOR RARTICI 



BACKGROUND OF THE INVENTION 

Retroviral vectors based on lentiviruses. such as human immunodeficiency 
viruses (HIV), can infect nondividing ceils, and integration of proviral DNA occurs 
5 without the need for cell division. These properties make lentiviruses attractive for gene 
transfer into nondividing cells, such as hepatocytes. myofibers. hematopoietic stem 
ceils, and neurons. 

However, the use of lentivinis vectors, particularly HIV vectors, particularly for 
eene therapv, is hampered by concern over their safety. Thus, a need for the 
10 development of lentivinis vectors, particularly HIV vectors, with improved safety, 
particularly for gene therapy, exists. 

SUMMARY OF THE INVENTION 

The present invention relates to novel packaging cell lines useful for generating 
viral accessory protein independent lentivirus-denved. particularly HIV-derived. 

15 retroviral vector particles, to construction of such cell lines and to methods of using the 
accessory protein independent lentivirus-denved retroviral vector panicles to introduce 
DNA of interest into cells (e.g. eukaryotic cells such as animal (particularly 
mammalian), plant or yeast cells or prokaryotic cells such as bacterial cells). In a 
preferred embodiment, the packaging cell lines of the present invention are stable 

20 packaging cell lines. 

In one embodiment of the invention, packaging cell lines for producing a viral 
accessory protein independent lentivirus-denved retroviral vector particles comprise (a) 
a cell (e.g., mammalian cell); and (b) a retroviral nucleotide sequence in the cell which 
comprises a coding sequence for lentivinis gagpol. wherein said coding sequence has 
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been eodon optimized by mutagenisis to improve expression of the lentivirus gagpol 
proteins. 

In second embodiment of the invention, packaging cell lines tor producing a 
viral accessory protein independent lentivirus-derived retroviral vector particles 
5 comprise (a) a cell (e.g., mammalian cell); (b) a first retroviral nucleotide sequence in 
the cell which comprises a coding sequence for lentivirus gagpol, wherein said coding 
sequence has been codon optimized by mutagenisis. to improve expression of the 
lentivirus gagpol proteins; and (c) a second retroviral nucleotide sequence in the cell 
which comprises the coding sequence for a heterologous envelope protein. 

10 In a third embodiment of the invention, packaging cell lines for producing a viral 

accessory protein independent lentivirus-derived retroviral vector particles comprise (a) 
a cell (e.g.. mammalian cell); (b) a first retroviral nucleotide sequence in the cell which 
comprises a coding sequence for lentivirus gagpol. wherein said coding sequence has 
been codon optimized by nxitagemsis to improve expression of the lentivirus gagpol 

15 proteins; (c) a second retroviral nucleotide sequence in the cell which comprises the 
coding sequence for a heterologous envelope protein; and (d) a third retroviral 
nucleotide sequence which comprises a DNA sequence of interest and lentivirus ex- 
acting sequences required for packaging, reverse transcription and integration. 

In a fourth embodiment of the invention, packaging cell lines for producing a 

20 viral accessory protein independent lentivirus-derived retroviral vector particles 

comprise (a) a cell (e.g., mammalian cell); (b) a retroviral nucleotide sequence in the 
cell which comprises a coding sequence for lentivirus gagpol, wherein said coding 
sequence has been codon optimized by mutagenisis to improve expression of the 
lentivirus gagpol proteins; and (c) a retroviral nucleotide sequence which comprises a 

25 DNA sequence of interest and lentivirus cis-acting sequences required for packaging, 
reverse transcription and integration. 

In a fifth embodiment of the invention, packaging cell lines for producing a viral 
accessory protein independent HIV-derived retroviral vector particles comprise (a ) a cell 
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(e.g., mammalian cell); and (b) a retroviral nucleotide .sequence in the cell which 
comprises a coding sequence ibr HIV gagpoL wherein said coding sequence has been 
codon optimized by mutagemsis to improve expression of the HIV gagpol proteins. 

In sixth embodiment of the invention, packaging cell lines lor producing a viral 
5 accessory protein independent HlV-derived retroviral vector particles comprise (aj a cell 
(e.g.. mammalian cell); (b) a first retroviral nucleotide sequence in the cell which 
comprises a coding sequence for HIV gagpol. wherein said coding sequence has been 
codon optimized by mutagenisis to improve expression of the HIV gagpol proteins; and 
(c) a second retroviral nucleotide sequence in the cell which comprises the coding 

1 0 sequence for a heterologous envelope protein. 

In a seventh embodiment of the invention, packaging cell lines for producing a 
viral accessor)' protein independent HIV-derived retroviral vector particles comprise (a) 
a cell (e.g., mammalian cell); (b) a first retroviral nucleotide sequence in the ceil which 
comprises a coding sequence for HIV gagpol, wherein said coding sequence has been 

15 codon optimized by mutagenisis to improve expression of the HIV gagpol proteins; (c) 
a second retroviral nucleotide sequence in the cell which comprises the coding sequence 
for a heterologous envelope protein; and (d) a third retroviral nucleotide sequence which 
comprises a DN A sequence of interest and HIV cis-acting sequences required for 
packaging, reverse transcription and integration. 

20 In a eighth embodiment of the invention, packaging cell lines for producing a 

viral accessor)' protein independent HIV-derived retroviral vector panicles comprise (a) 
a cell (e.g.. mammalian cell); (b) a retroviral nucleotide sequence m the cell which 
comprises a coding sequence for HIV gagpol, wherein said coding sequence has been 
codon optimized by mutagenisis to improve expression of the HIV gagpol proteins; and 

25 (c) a retroviral nucleotide sequence which comprises a DNA sequence of interest and 
HIV cis-acting sequences required for packaging, reverse transcription and integration. 

Alternatively, each of the packaging cell lines described herein can be produced 
using (1) a retroviral nucleotide sequence which comprises a codon optimized gag 
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coding sequence and (2) a retroviral nucleotide sequence which comprises a codon 
optimized pol coding sequence, in place of the retroviral nucleotide sequence which 
comprises a codon optimized gagpol coding sequence. 

In a particular embodiment, the heterologous envelope protein is the G 
5 glycoprotein of vesicular stomatitis virus < YSY G). In another embodiment, the 

heterologous envelope protein is the amphotropic envelope of the Moloney leukemia 
virus (MLV). 

Cell lines for producing a viral accessory protein independent lentivirus-derived 
retroviral vector particles are produced by transfecting host cells (e.g.. mammalian host 
1 0 cells ) with a plasmid comprising; a DNA sequence which encodes Antivirus gagpol 
proteins, wherein said DNA sequence has been codon optimized by mutagenisis to 
improve expression of the lentivirus gagpol proteins. Depending upon the particular 
cell line being produced, the host cells are also co-transfected with a plasmid 
comprising a DNA sequence »which encodes a heterologous envelope protein, or a 

1 5 plasmid comprising a DNA sequence of interest and lentivirus cis-acting sequences 
required for packaging, reverse transcription and integration, or both of these plasmids. 
Alternatively, host cells are transfected with a plasmid comprising a codon optimized 
DNA sequence encoding a lentivirus gag protein and a plasmid comprising a codon 
optimized DNA sequence encoding a lentivirus pol protein, in place of the plasmid 

20 comprising a codon optimized DNA sequence encoding both lentivirus gagpol proteins. 
Cell lines for producing a viral accessory protein independent HIV-derived 
retroviral vector particles are produced by co-transfectmg host cells (e.g., mammalian 
host cells) with a plasmid comprising a DNA sequence which encodes HIV gagpol 
proteins, wherein said DNA sequence has been codon optimized by mutagenisis to 

25 improve expression of the HIV gagpol proteins. Depending upon the particular cell line 
being produced, the host cells are also co-transfected with a plasmid comprising a DNA 
sequence which encodes a heterologous envelope protein, or a plasmid comprising a 
DNA sequence of interest and HIV cis-acting sequences required for packaging, reverse 
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transcription and integration, or both of these plasmids. Alternatively, host cells are 
transacted with a plasmid comprising a codon optimized DNA sequence encoding a 
IIIY gag protein and a plasmid comprising a codon optimized DNA sequence encoding 
a HIV pol protein, in place of the plasmid comprising a codon optimized DNA sequence 
5 encoding both HIV gagpol proteins. 

The present invention also relates to methods of producing viral accessory' 
protein independent lentivirus-derived retroviral vector particles, comprising co- 
transfecting host cells (e.g., mammalian host cells) with (a) a first plasmid comprising a 
DNA sequence which encodes lentivirus gagpol proteins, wherein said DNA sequence 

10 has been codon optimized by mutagenisis to improve expression of the lentivirus gagpol 
proteins; (b) a second plasmid comprising a DNA sequence which encodes a 
heterologous envelope protein; and (c) a third plasmid comprising a DNA sequence of 
interest and lentivirus cis-acting sequences required for packaging, reverse transcription 
and integration. Alternatively, host cells are transfected with a plasmid comprising a 

15 codon optimized DNA sequence encoding a lentivirus gag protein and a plasmid 

comprising a codon optimized DNA sequence encoding a lentivirus pol protein, in place 
of the first plasmid comprising a codon optimized DNA sequence encoding both 
lentivirus gagpol proteins. 

In a particular embodiment, the invention relates to methods of producing viral 

20 accessory protein independent HIV-derived retroviral vector particles, comprising co- 
transfecting host cells (e.g., mammalian host cells) with (a) a first plasmid comprising a 
DNA sequence which encodes HIV gagpol proteins, wherein said DNA sequence has 
been codon optimized by mutagenisis to improve expression of the HIV gagpol 
proteins; (b) a second plasmid comprising a DNA sequence which encodes a 

25 heterologous envelope protein; and (c) a third plasmid comprising a DNA sequence of 
interest and HIV cis-acting sequences required for packaging, reverse transcription and 
integration. Alternatively, host cells are transfected with a plasmid comprising a codon 
optimized DNA sequence encoding a HIV gag protein and a plasmid comprising a 
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codon optimized DNA sequence encoding a HIV poi protein, in place of the first 
plasmid comprising a codon optimized DNA sequence encoding both HIV gagpoi 
proteins. 

The present invention also relates to viral accessory protein-independent 
5 retroviral particles produced by or obtainable by (obtained by) the methods described 
herein. 

The present invention further relates to isolated DNA encoding a codon 
optimized lenti virus gagpoi isolated DNA encoding the gag coding region of a codon 
optimized lenti virus gagpoi and isolated DNA encoding the pal coding region of a 

10 codon optimized lenti virus gagpoi. In a particular embodiment, the present invention 
relates to isolated DNA encoding a codon optimized HIV gagpoi isolated DNA 
encoding the gag coding region of a codon optimized HIV gagpoi and isolated DNA 
encoding the/w/ coding region of a codon optimized HIV gagpoi 

The packaging celNines and viral particles of the present invention can be used 

1 5 for gene therapy or gene replacement with improved safety. The packaging cell lines 
and viral particles of the present invention can also be used in development and 
production of vaccines, and in production of biochemical reagents. Gene therapy 
vectors produced with the cell lines of the present invention are expected to be valuable 
medical therapeutics. 

20 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a schematic diagram of an expression cassette containing the codon 
optimized gagpoi genes. The DNA was constructed in multiple segments, which are 
indicated at the top as 1/3, 2/3, 3/3 (A, B, C and D) and HIN. Restriction sites used to 
assemble the cloned segments are indicated above the kilobasepair (Kb) ruler. Below 

25 the ruler are multiple features showing the location of the human cytomegalovirus 
(CMV) promoter, human betaglobin sequences (Bglobin) ? mRNA sequences (thinner 
line represents intronic sequence), the gag and pol open reading frames, the individual 
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proteolytic fragment coding sequences ipl? MA. p24_C \. p7. p6. PR. p51_RT. 
RNaseH and integrase (IN)) and each synthetic oligonucleotide used in the assembly 
process (multiple adjacent open arrows). 

Figure 2 is a table which depicts codon usage frequencies in genes which are 
5 highly expressed and in the codon optimized gagpol open reading frame of the HIV 
packaging construct described herein. 

Figure 3 is a schematic representation of the HIV provirus and a three-plasmid 
expression system used for generating a pseudotyped HIV-based vector by transient 
transfection as described in Naldini et al. % Science, 272:263-267 (1996). 
10 Figure 4 is a list of some characteristics relating to the HIV Rev protein. 

Figure 5 is a list of some points relating to codon optimization of HIV gagpol. 

Figure 6 is a partial DNA sequence of HIV gag (SEQ ID NO: 1), showing 
inactivation of inhibitory sequences as described in Schwartz, S. et a!.. J. Virol., 
tftf(12):7 176-7 182 (1992). * 
15 Figure 7 a plot of the %(G+C) content of wildtype HIV gagpol sequences and 

theoretically codon optimized HIV gagpol sequences. The percent of bases, either G or 
C, was calculated for a 30 nucleotide moving window for the entire length of the gagpol 
gene, and the value plotted versus nucleotide position. Diamonds = HIV gagpol 
sequences; squares = full optimal back-translation for gag open reading frame; 
20 triangles - full optimal back-translation for pol open reading frame; CO = codon 
optimized. 

Figures 8A-8E depict the alignment of the nucleotide sequences and predicted 
ammo acid sequences for the gag coding region of a wildtype HIV gagpol and a codon 
optimized HIV gagpol. "NL4-3 genbank.SEQ" indicates the nucleotide sequence (SEQ 
25 ID NO:2 ) and predicted amino acid sequence (SEQ ID NO: 3) for the gag coding region 
of a wildtype HIV gagpol. "pHDMHcpm2.seq" indicates the nucleotide sequence (SEQ 
ID NO:4) and predicted amino acid sequence (SEQ ID NO:5) for the gag coding region 
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of a codon optimized HIV gagpol. The "NL4-3 genbank.SEQ" sequences are publicly- 
available at the NIH GenBank sequence repository (Aceesssion No. Ml 9921). 

Figures 9A-9E depict the alignment of the nucleotide sequences and predicted 
amino acid sequences for the pol coding region of a wildtype HIV gagpol and a codorr 
5 optimized HIV gagjwl. "NL4-3 genbank.SEQ" indicates a nucleotide sequence fSEQ 
ID NO:6) and a predicted amino acid sequence (SEQ ID NO:7) for the pol coding 
region of a wildtype HIV gagpol available in the NIH GenBank sequence repository 
( Aceesssion No. Ml 9921). The nucleotide and amino acid sequences for the pol coding 
region available in the GenBank sequence repository contain two sequence errors, 

10 which are indicated in Figures 9A-9L with shading. "pKL4-3.scq" indicates the correct 
nucleotide sequence (SEQ ID NO:8) and predicted amino acid sequence (SEQ ID 
NO:9) for the pol coding region of a wildtype HIV gagpol. "pHDMHgpm2.seq" 
indicates the nucleotide sequence (SEQ ID NO: 10) and predicted amino acid sequence 
( SEQ ID NO:l 1) for the poLcoding region of a codon optimized HIV gagpol. 

15 Figures 10A-10D depict the DNA sequence (SEQ ID NO: 12) for pHDMHgpm2. 

The CMV enhancer/promoter is at nucleotides 97 to 679, human betaglobin sequences 
(Bgiobin) are at nucleotides 761 to 864, 865 to 1303 and 5710 to 6469 (end of Bglobin 
is at nucleotdes 6445 to 6469), mRNA sequences are at nucleotides 680 to 778 and 1255 
to 5921, SV40 origin of replication is at nucleotides 8796 to 8908. beta-lactamasc (bla) 

20 coding region is at nucleotides 6709 to 7569, intron sequences are at nucleotides 779 to 
1254, the codon optimized gag coding region is at nucleotides 1318 to 2820, the codon 
optimized pol coding region is at nucleotides 2619 to 5624 and the poly A site is at 
nucleotides 5897 to 592 E 

Figure 1 1 is a circular map of plasmid pHDMHgpm2. 

25 DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to novel packaging cell lines useful for generating 
viral accessory protein independent lentivirus-derived, particularly HIV-derived, 
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retroviral vector panicles, to construction of such cell lines and to methods of using the 
accessory protein independent lentivirus-derived retroviral vector particles to introduce 
DXA of interest into cells ('e.g. eukaryotic cells such as animal (particularly 
mammalian), plant or yeast cells or prokaryotic cells such as bacterial cells). In a 
5 particular embodiment, the packaging cell lines of the present invention are stable 
packaging cell lines. 

The cell lines are engineered to express the lentivirus proteins necessary for 
virus particle formation (gagpol proteins), without containing DNA sequences from 
lentivirus accessory proteins (tat. vif, vpr. vpu, nef and rev proteins and Rev response 

10 element (RREVt. Additionally, no viral sequences (such as cis-acting elements termed 
constitutive transport elements (CTEs)) will be expressed as RNA of any kind. DNA 
sequences for lentivirus gagpol are codon optimized by extensively mutagenizing the 
sequences to improve expression and to reduce the risk of recombination between 
transfer vector sequences and gagpol messenger RNA. This greatly improves the safety 

15 of virus preparations generated from these cell lines. In a particular embodiment, the 
DNA sequences for lentivirus gagpol are not codon optimized in the overlap region 
between the gag and pol sequences and in cis-acting signals necessary for translation of 
pol. 

Examples of lentiviruses include human immunodeficiency viruses (e.g.. HIV-1, 
20 HIV-2. HIV-3), bovine lentiviruses (e.g., bovine immunodeficiency viruses, bovine 
immunodeficiency-like viruses, Jembrana disease viruses), equine lentiviruses (e.g., 
equine infectious anemia viruses), feline lentiviruses (e.g., feline immunodeficiency 
viruses, panther lentiviruses. puma lentiviruses), ovine/caprine lentiviruses (e.g.. 
Brazilian caprine lentiviruses, caprine arthritis-encephalitis viruses. Maedi-Visna 
25 viruses. Maedi-Visna-like viruses. Maedi-Visna-reiated viruses, ovine lentiviruses. 
Visna lentiviruses), Simian AIDS retroviruses (e.g.. human T-cell lymphotropic virus 
tvpe 4). simian immunodeficiency viruses, simian-human immunodeficiency viruses, 
human lymphotrophic viruses (e.g.. type III), simian T-cell tymphotrophic viruses. 
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In another embodiment, cell lines are engineered to express the HIV proteins 
necessary for virus particle formation (gagpol proteins), without containing DNA 
sequences from HIV accessory proteins (tat, vif. vpr, vpu. nef and rev proteins and Rev- 
response element (RRE)). Additionally, no viral sequences (such as cis-acting elements 
5 termed constitutive transport elements (CTEs)) will be expressed as RNA of any kind. 
DNA sequences for a HIV gagpol are codon optimized by mutagenesis to improve 
expression and to reduce the risk of recombination between transfer vector sequences 
and gagpol messenger RNA. In a particular embodiment, the DNA sequences for HIV 
gagpol are not codon optimized in the overlap region between the gag and pol 
10 sequences and in cis-acting signals necessary for translation of pol. 

Alternatively, each of the packaging cell lines described herein can be produced 
using (1) a nucleotide sequence which comprises a codon optimized gag coding 
sequence and (2) a nucleotide sequence which comprises a codon optimized pol coding 
sequence, in place of the nucleotide sequence which comprises a codon optimized 
15 gagpol coding sequence. In this embodiment, the gag and pol coding sequences can be 
completely codon optimized 

Benefits of the present invention include the removal of potentially harmful 
lentivirus accessory proteins and other viral sequences, and the reduction of the risk of 
recombination to produce replication competent virus. 
20 Packaging cell lines for producing a viral accessory protein independent 

lentivirus-derived retroviral vector particles comprise a mammalian cell and a retroviral 
nucleotide sequence comprising a coding sequence for a lentivirus gagpol which has 
been codon optimized. In a particular embodiment the packaging cell lines further 
comprise a retroviral nucleotide sequence comprising a coding sequence for a 
25 heterologous envelope protein. In a second embodiment, the packaging cell lines 

further comprise a retroviral nucleotide sequence comprising a coding sequence for a 
heterologous envelope protein and a retroviral nucleotide sequence which comprises a 
DNA sequence of interest and HIV cis-acting sequences required for packaging, reverse 
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transcription and integration. In third embodiment, the packaging cell lines further 
comprise a retroviral nucleotide sequence which comprises a DNA sequence of interest 
and HIY cis-acting sequences required for packaging, reverse transcription and 
integration. Alternatively, the packaging cell lines of the present invention comprise a 
5 retroviral nucleotide sequence which comprises a codon optimized gag coding sequence 
and (2) a retroviral nucleotide sequence which comprises a codon optimized pol coding 
sequence, in place of the retroviral nucleotide sequence which comprises a codon 
optimized gagpol coding sequence. 

The coding sequence(s) for lentivirus gagpol which has (have) been codon 

10 optimized results in improved expression of the lentivirus gagpol proteins and reduces 
the risk of recombination between the transfer vector and gagpol messenger RNA. 
Codon optimization of the coding sequence(s) for lentivirus gagpol was obtained by 
mutagenizing for each particular amino acid residue, specific nucleic acid bases in a 
codon for the particular ammo acid residue to a nucleic acid base which is present in a 

1 5 codon which occurs at a high frequency in genes which are highly expressed for the 
same amino acid residue. In a particular embodiment, the resulting optimized codon 
also does not cause introduction of mRNA splicing signals into the codon optimized 
sequence. Thus, in a particular embodiment, codon optimization of the coding 
sequence(s) for lentivirus gagpol is obtained by mutagenizing for each particular amino 

20 acid residue, specific nucleic acid bases in a codon for the particular amino acid residue 
to a nucleic acid base that is present in a codon which (1) occurs at a high frequency in 
genes which are highly expressed for the same amino acid residue and (2) does not 
cause introduction of mRNA splicing signals into the codon optimized sequence. 
Codon optimization typically results in the removal of nucleic acid base A-rich 

25 instability elements. 

In a particular embodiment, the coding sequence for a HIV gagpol (pNL4-3; 
available through the AIDS repository, NIH; Adachi et al., J. Virol.. 59:284-291 (1986)) 
has been codon optimized to improve translational efficiency of the HIV gagpol 
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proteins and reduce the risk ol" recombination between the transfer vector and HIV 
gagpol messenger RNA. Two hundred thirty-seven base pairs (237 bp j consisting of the 
gag pol overlap and cis-acting signals necessary for translation of pol (nucleotides 2583 
to 28 1 c > of SEQ ID NO: 12} were not optimized. The HIV gagpol sequence obtained ~ 
5 using the codon optimization process does not differ at the amino acid level from the 
wildtype HIV gagpol sequence, but differs at the nucleotide level from the HIV gagpol 
sequence. A codon optimized HIV gag sequence is shown in Figures 8A-8E 
(pHDMHgpm2.seq) (SEQ ID NO:4). A codon optimized HIV pol sequence is shown in 
Figures 9A-9L (pHDMHgpm2.seq) (SEQ ID NO: 10). 

10 A piasmid comprising DNA sequences which encode codon optimized lentivirus 

gagpol proteins is also referred to herein as a packaging construct. This piasmid 
includes a promoter which drives the expression of the gagpol proteins, such as the 
human cytomegalovirus (hCMV) immediate early promoter. This piasmid is defective 
for the production of the viral envelope and accessory proteins tat, vif, vpr, vpu. nef and 

1 5 rev and the Rev response element (RRE). The packaging construct also does not 

contain viral sequences which are transcribed into mRNA. such as constitutive transport 
elements (CTEs). 

A packaging construct comprising a codon optimized HIV gagpol is depicted in 
Figure 1 and in Figure 1 1 . Figures 10A-10D depict the DNA sequence (SEQ ID 

20 NO: 1 2) for the packaging construct pHDMHgpm2. This packaging construct 
(pHDMHgpm2) was constructed as follows: Piasmid pMDA.HIVgp mam was 
generated by chemical synthesis and PCR assembly (which is described in, for example. 
Stemmer et al, Gene, 164:49-53 (1995)) of 21 5 different oligonucleotides. The DNA 
sequence for pMDA.HIVgp mam is the same as the DNA sequence for 

25 pMDA.HIVgp jtg except for 4.3 kb which was codon optimized using the DNAStar 
program (LaserGene, Madison, WI). Two hundred thirty-seven base pairs (237 bp) 
consisting of the gag pol overlap and cis-acting signals necessary for translation ot pol 
(nucleotides 2583 to 2819 of SEQ ID NO: 12) were not optimized due to dual reading 
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frame constraints. A Nsil site 5" of IN was preserved to aid fusion with wildtype 
sequences. Several single or double base pair silent mutations were introduced either to 
prevent potential spiiee donors and acceptors, or by the synthesis process. 
pMDA.MIVcp jtg was derived from HIV-1 strain NF4-3. The protease mutation tharis 
5 present in the NL4-3 NIH GenBank sequence was then repaired (Figure 9B). changing 
the nucleotide present at position 2948 of SEQ ID NO: 12 from a "G" to a "C". thereby 
producing the codon present at nucleotide positions 2948 to 2950 of SEQ ID NO: 1 2 
which encodes an arginine instead of the glycine present in the NL4-3 GenBank amino 
acid sequence. The resulting plasmid was named pMDHgpmam. The EeoRI-Hindlll 

■ 0 fragment of pMDHgpmam was inserted into pHDM2b. a high copy version of the pMD 
vector (Ory. D. era/.. Proc. Natl. Acad. Sci. USA, 93(2 1): 1 1400-1 1406 ( 1 996)). to 
produce plasmid pHDMHgpm. The sequencing mutation that is present in the RNase 
domain of the NI.4-3 NIH GenBank sequence was repaired (Figure 9H), changing the 
codon present at nucleotide positions 4724 to 4726 of SEQ ID NO: 12 from "GGG" to 

15 "AAG", thereby producing a codon encoding a lysine instead of the glycine present in 
the NL4-3 GenBank amino acid sequence. The resulting plasmid was named 
pHDMHgpm2. Codon usage frequencies in the codon optimized gagpol open reading 
frame of the packaging construct pHDMHgpm2 are shown in Figure 2. 

As used herein, a heterologous envelope protein permits pseudotyping of 

20 particles generated by the packaging construct and includes the G glycoprotein of 
vesicular stomatitis virus (VSV G) and the amphotropic envelope of the Moloney 
leukemia virus (MLV). A plasmid comprising a DNA sequence which encodes a 
heterologous envelope protein is also referred to herein as an envelope coding plasmid. 
The terms "mammal" and "mammalian", as used herein, refer to any vertebrate 

25 animal, including monotremes. marsupials and placental, that suckle their young and 
either give birth to living young (eutharian or placental mammals) or are egg-laying 
(metatharian or nonplacental mammals). Examples of mammalian species include 
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humans and other primates (e.g.. monkeys, chimpanzees), rodents (e.g., rats, mice, 
guinea pigs) and ruminents (e.g., cows, pigs, horses). 

Examples of mammalian cells include human (such as He-La cells. 293T cells. 
NIH 3T3 cells), bovine, ovine, porcine, murine (such as embryonic stem cells), rabbit ~ 
5 and monkey (such as COS1 cells) cells. The cell may be a non-dividing eel! (including 
hepatocytes, myofibers, hematopoietic stem cells, neurons) or a dividing cell. The cell 
may be an embryonic cell, bone marrow stem cell or other progenitor cell. Where the 
cell is a somatic cell, the cell can be, for example, an epithelial cell, fibroblast, smooth 
muscle cell, blood cell (including a hematopoietic cell, red blood cell. T-celL B-celL 

10 etc.). tumor cell, cardiac muscle cell, macrophage, dendritic cell, neuronal cell (e.g., a 
glial cell or astrocyte), or pathogen-infected cell (e.g., those infected by bacteria, 
viruses, virusoids, parasites, or prions). 

Typically, cells isolated from a specific tissue (such as epithelium, fibroblast or 
hematopoietic cells) are categorized as a "cell-type." The cells can be obtained 

15 commercially or from a depository or obtained directly from an animal, such as by 
biopsy. Alternatively, the cell need not be isolated at all from the animal where, for 
example, it is desirable to deliver the virus to the animal in gene therapy. 

To produce the cell lines of the present invention for producing a viral accessory- 
protein independent lentivirus-derived retroviral vector particles, mammalian host cells 

20 are co-transfected with (a) a first plasmid comprising DNA sequence which encode 
lentivirus gagpol proteins, wherein said DNA sequence has been codon optimized by 
mutagenisis. as described above, to improve expression of the lentivirus gagpol 
proteins; and (2) a second plasmid comprising a DNA sequence which encodes a 
heterologous envelope protein, or a retroviral nucleotide sequence which comprises a 

25 DNA sequence of interest and lentivirus cis-acting sequences required for packaging, 
reverse transcription and integration, or both, under conditions appropriate for 
transfection of the cells. 
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In a particular embodiment, to produce the cell lines of the present invention for 
producing viral accessory protein independent HIY-denved retroviral vector panicles 
mammalian host cells were eotranslected with (a) a first piasmid comprising ONA 
sequence which encode 1 HY gagpol proteins, wherein said DNA sequence has been 
5 codon optimized by mutagenisis. as described above, to improve expression of the HIV 
gagpol proteins; and (2) a second piasmid comprising a DNA sequence which encodes a 
heterologous envelope protein, or a retroviral nucleotide sequence which comprises a 
DNA sequence of interest and HIV cis-acting sequences required for packaging, reverse 
transcription and integration, or both, under conditions appropriate for transfection of 
10 the ceils. 

Virus stocks consisting of viral accessory protein independent lentivirus-derived, 
particularly HlV-derived. retroviral vector particles of the present invention are 
produced by maintaining the transfected cells under conditions suitable for virus 
production (e.g.. in an appropriate growth media and for an appropriate period of time). 

1 5 Such conditions, which are not critical to the invention, are generally known in the art. 
See. e.g., Sambrook ct a! , Molecular Cloning: A Laboratory Manual. Second Edition, 
Cold Spring Harbor University Press, New York (1989); Ausubel et ai. Current 
Protocols in Molecular Biology. John Wiley & Sons, New York (1998); U.S. Patent 
No. 5.449.614; and U.S. Patent No. 5.460,959, the teachings of which are incorporated 

20 herein by reference. 

To generate viral accessory protein independent lentivirus-derived retroviral 
vector particles, mammalian host cells can be co-transfected with (a) a first piasmid 
comprising DNA sequence which encode lenti virus gagpol proteins, wherein said DNA 
sequence has been codon optimized by mutagenisis, as described above, to improve 

25 expression of the lenti virus gagpol proteins; (b) a second piasmid comprising a DNA 
sequence which encodes a heterologous envelope protein; and (c) a third piasmid 
comprising a DNA sequence of interest and lentivirus cis-acting sequences required for 
packaging, reverse transcription and integration. Alternatively, mammalian cells are 
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transfected with a plasmid comprising a codon optimized DNA sequence encoding a 
lentivirus gag protein and a plasmid comprising a codon optimized DNA sequence 
encoding a lenti virus pol protein, in place of the first plasmid comprising a codon 
optimized DNA sequence encoding both lentivirus gagpol proteins. Alternatively, 
5 mammalian host cells are transfected with a plasmid comprising a codon optimized 
DNA sequence encoding a lentivirus gag protein and a plasmid comprising a codon 
optimized DNA sequence encoding a lentivirus pol protein, in place of the first plasmid 
comprising a codon optimized DNA sequence encoding both lentivirus gagpol proteins. 
In a particular embodiment, the invention relates to methods of producing viral 

1 0 accessory protein independent HIV-derivcd retroviral vector particles, comprising co- 
transfecting mammalian host cells with (a) a first plasmid comprising DNA sequence 
which encode HIV gagpol proteins, wherein said DNA sequence has been codon 
optimized by mutagenisis, as described above, to improve expression of the HIV gagpol 
proteins; (b) a second plasmid containing a DNA sequence which encodes a 

1 5 heterologous envelope protein; and (c) a third plasmid comprising a DNA sequence of 
interest and HIV cis-acting sequences required for packaging, reverse transcription and 
integration. Alternatively, mammalian host cells are transfected with a plasmid 
comprising a codon optimized DNA sequence encodmg a HIV gag protein and a 
plasmid comprising a codon optimized DNA sequence encoding a HIV pol protein, in 

20 place of the first plasmid comprising a codon optimized DNA sequence encoding both 
HIV gagpol proteins. 

Virus particles produced by the methods described herein, using a codon 
optimized HIV packaging construct produced as described herein, were compared by 
Western analysis with virus particles produced as described in Naldini et al.. Science, 

25 272:263-261 (1996), using the packaging construct plasmid pCMVAR8.2. Both the 
immunological reactivity and the proteolytic processing were confirmed to be 
indistinguishable. 
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A plasmiii comprising a DNA sequence of interest and HIV eis-acting sequences 
required lor packaging, reverse transcription and integration is aiso re 1 erred to herein as 
a transfer vector. A transfer vector, as used herein, refers to a vehicle which is used to 
introduce a DNA of interest into a eurkaryotic cell, particularly a mammalian cell. 
5 figure 3 depicts an example of a transfer vector. 

DNA sequence of interest, as used herein, include all or a portion of a gene or 
eenes encoding a nucleic acid product whose expression in a ceil or a mammal is 
desired. In a particular embodiment, the nucleic acid product is a heterologous 
therapeutic protein. Examples of therapeutic proteins include antigens or immunogens. 

! 0 such as a polyvalent vaccine, cytokines, tumor necrosis factor, interferons, interleukins. 
adenosine deaminase, insulin. T-cell receptors, soluble CD4. growth factors, such as 
epidermal growth factor, human growth factor, insulin-like growth factors, fibroblast 
growth factors), blood factors, such as Factor VIII. Factor IX. cytochrome b. 
glucocerebrosidase. ApoE/ApoC, ApoAI. the LDL receptor, negative selection markers 

15 or "suicide proteins", such as thymidine kinase (including the MSV, CMV, VZV TK), 
anti-angiogenic factors. Fc receptors, plasminogen activators, such as t-PA, u-PA and 
streptokinase, dopamine. MHC. tumor suppressor genes such as p53 and Rb. 
monoclonal antibodies or antigen binding fragments thereof, drug resistance genes, ion 
channels, such as a calcium channel or a potassium channel, adrenergic receptors. 

20 hormones (including growth hormones) and anti-cancer agents. In another embodiment, 
the nucleic acid product is a gene product to be expressed in a cell or a mammal and 
which product is otherwise defective or absent in the cell or mammal. For example, the 
nucleic acid product can be a functional gene(s') which is defective or absent in the cell 
or mammal. 

25 DNA sequence of interest includes DNA sequences (control sequences) which 

are necessary to drive the expression of the gene or genes. The control sequences are 
operably linked to the gene. The term "operably linked", as used herein, is defined to 
mean that the gene is linked to control sequences in a manner which allows expression 
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of the gene (or the nucleic acid sequence). Generally, operably linked means 
contiguous. 

Control sequences include a transcriptional promoter, an optional operator 
sequence to control transcription, a sequence encoding suitable mRNA ribosomal 
5 binding sites and sequences which control termination of transcription and translation. 
In a particular embodiment, a recombinant gene encoding a desired nucleic acid product 
can be placed under the regulatory control of a promoter which can be induced or 
repressed, thereby offering a greater degree of control with respect to the level of the 
product produced. 

1 0 As used herein, the term "promoter" refers to a sequence of DNA, usually 

upstream (5') of the coding region of a structural gene, which controls the expression of 
the coding region by providing recognition and binding sites for RNA polymerase and 
other factors which may be required for initiation of transcription. Suitable promoters 
are well known in the art. Exemplary promoters include the SV40, CMV and human 

1 5 elongation factor (EFI) promoters. Other suitable promoters are readily available in the 
art (see. e.g., Ausubel et al, Current Protocols in Molecular Biology, John Wiley & 
Sons, Inc., New York (1998); Sambrook et al. Molecular Cloning: A Laboratory 
Manual. 2nd edition. Cold Spring Harbor University Press, New York (1989): and U.S. 
Patent No. 5.681,735). 

20 A DNA sequence of interest can be isolated from nature, modified from native 

sequences or manufactured de novo, as described in, for example, Ausubel et al., 
Current Protocols in Molecular Biology, John Wiley & Sons, New York (1998); and 
Sambrook et al, Molecular Cloning: A Laboratory Manual 2nd edition. Cold Spring 
Harbor University Press, New York. (1989). DNA sequences can be isolated and fused 

25 together by methods known in the art, such as exploiting and manufacturing compatible 
cloning or restriction sites. 

The packaging cell lines and viral particles of the present invention can be used, 
in vitro, in vivo and ex vivo, to introduce DNA of interest into a eukaryotic cell (e.g., a 
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rr.ammalian cell) or a mammal (e.g.. a human or nther mamma! or vertebrate). The cells 
can be obtained commercially or iron: a depository or obtained direct]} from a mammal, 
such as bv biopsy. The cells can be obtained from a mammal to whom they will be 
returned or from another/different mammal of the same or different species. For 
5 example, using the packaging ceil lines or viral panicles of the present invention. DNA 
of interest can be introduced into nonhuman cells, such as pig cells, which are then 
introduced into a human. Alternatively, the cell need not be isolated from the mammal 
where, for example, it is desirable to deliver vial particles of the present invention to the 
mammal in gene therapy. 

i() Ex viva therapy has been described, for example, in Kasid et aL, Proc. Sad. 

Acad. Sci. USA, £ 7 :473 (1990); Rosenberg et ai, S. Engl. J. Med.. 323:570 (1990); 
Williams tf/ a/., Nature, 310:476 ( 1 984); Dick et aL Cell, 42:7\ (1985); Keller et aL 
Nature, 318: 149 (1985); and .Anderson et aL United States Patent No. 5,"99.346. 

Methods for administering (introducing) viral particles directly to a mammal are 

1 5 generally known to those practiced in the art. For example, modes of administration 
include parenteral, injection, mucosal, systemic, implant, intraperitoneal, oral, 
intradermal, transdermal (e.g.. in slow release polymers), intramuscular, intravenous 
including infusion and/or bolus injection, subcutaneous, topical, epidural, etc. Viral 
particles of the present invention can. preferably, be administered in a pharmaceutical^ 

20 acceptable carrier, such as saline, sterile water. Ringer's solution, and isotonic sodium 
chloride solution. 

The dosage of a viral particle of the present invention administered to a 
mammal, including frequency of administration, will vary depending upon a variety ot 
factors, including mode and route of administration; size, age, sex. health, body weight 
25 and diet of the recipient mammal; nature and extent of symptoms of the disease or 
disorder being treated; kind of concurrent treatment, frequency of treatment, and the 
effect desired. 
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The teachings of all the articles, patents, patent applications and GenBank 
sequences cited herein are incorporated by reference in their entirety. 

While this invention has been particularly shown and described with references 
to preferred embodiments thereof, it will be understood by those skilled in the art that" 
various changes in form and details may be made therein without departing from the 
spirit and scope of the invention as defined by the appended claims. 
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CLAIMS 

What is claimed is: 

1 . A packaging cell line for producing a viral accessory proiein independent 11IV- 
derived retroviral vector particle comprising: 

a) a mammalian cell; 

b) a first retroviral nucleotide sequence in the cell which comprises a 
coding sequence for a HIV gagpoi wherein said coding sequence has 
been mutagenized to improve expression of the HIV gagpoi proteins: 

c) a second retroviral nucleotide sequence in the ceil which comprises the 
coding sequence for a heterologous envelope protein; and 

d) a third retroviral nucleotide sequence in the ceil which comprises a DNA 
sequence of interest and HIV cis-acting sequences required for 
packaging, reverse transcription and integration. 

2. A packaging cell line of Claim 1 wherein the heterologous envelope protein is 
the G glycoprotein of vesicular stomatitis virus (VSV G). 

3. A packaging cell line of Claim 1 wherein the heterologous envelope protein is 
the amphotropic envelope of the Moloney leukemia virus. 

4. A packaging cell line of Claim 1 wherein the DNA sequence of interest encodes 
a heterologous therapeutic protein. 

5. A packaging cell line comprising: 
a) a mammalian cell; 
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b) a first retroviral nucleotide sequence in the cell which comprises a 
coding sequence ior a HIV gagpol. wherein said coding sequence has 
been mutagenized to improve expression of the HIV gagpol proteins; and 

c) a second retroviral nucleotide sequence in the cell which comprises a ~ 
DNA sequence of interest and HIV cis-acting sequences required for 
packaging, reverse transcription and integration. 

6. A packaging cell line of Claim 5 wherein the DNA sequence of interest encodes 
a heterologous therapeutic protein. 

7. A packaging cell line comprising: 

a) a mammalian cell; 

b) a first retroviral nucleotide sequence in the cell which comprises a 
coding sequence for a HIV gagpol, wherein said coding sequence has 
been mutagenized to improve expression of the HIV gagpol proteins; and 

c) a second retroviral nucleotide sequence in the cell which comprises the 
coding sequence for a heterologous envelope protein. 

8. A method of producing a packaging cell line for producing a viral accessory 
protein independent HIV-derived retroviral vector particle, comprising co- 
transfecting mammalian host cells with: 

a) a first plasmid comprising a DNA sequence which encodes HIV gagpol 
proteins, wherein said DNA sequence has been mutagenized to improve 
expression of the HIV gag and pol proteins; 

b) a second plasmid comprising a DNA sequence which encodes a 
heterologous envelope protein; and 
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c) a third plasmid comprising a I)\A sequence of interest and HIY cis- 
aetme sequences required for packaging, reverse transcription and 

integration. 

9. A method of Claim 8 wherein the heterologous envelope protein is the G 
5 glycoprotein of vesicular stomatitis virus (VSV G). 

10. A method of Claim 8 wherein the heterologous envelope protein is the 
amphotropic envelope of the Moloney leukemia virus. 

11. A method of Claim 8 wherein the DNA sequence of interest is a heterologous 
therapeutic protein. 

10 12. A method of producing a viral accessory protein independent HIV-derived 

retroviral vector particle comprising co-transfecting mammalian host cells with: 
a) a first plasmid comprising a DNA sequence which encodes HIV gagpol 
proteins, wherein said DNA sequence has been mutagenized to improve 
expression of the HIV gagpol proteins; 
15 b) a second plasmid comprising a DNA sequence which encodes a 

heterologous envelope protein; and 
c) a third plasmid comprising a DNA sequence of interest and HIV cis- 
acting sequences required for packaging, reverse transcription and 
integration. 



20 13. 



A method of Claim 12 wherein the heterologous envelope protein is the G 
glycoprotein of vesicular stomatitis virus (VSV G). 
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14. A method of Claim 12 wherein the heterologous envelope protein is the 
amphotropie envelope of the Moloney leukemia virus. 

15. A method of Claim 12 wherein the DNA sequence of interest eneodes a 
heterologous therapeutic protein. 

5 16. A packaging cell line for producing a viral accessory protein independent 
lentivirus-derived retroviral vector particle comprising: 

a) a mammalian cell; 

b) a first retroviral nucleotide sequence in the cell which comprises a 
coding sequence for a lentivirus gagpoL wherein said coding sequence 

10 has been mutagenized to improve expression of the lentivirus gagpol 

proteins; 

c) a second retroviral nucleotide sequence in the cell which comprises the 
coding sequence for a heterologous envelope protein; and 

d) a third retroviral nucleotide sequence in the cell which comprises a DNA 
i 5 sequence of interest and lentivirus cis-actmg sequences required for 

packaging, reverse transcription and integration. 

17. A packaging cell line of Claim 16 wherein the heterologous envelope protein is 
the G glycoprotein of vesicular stomatitis virus (VSV G). 

18. A packaging cell line of Claim 16 wherein the heterologous envelope protein is 
20 the amphotropic envelope of the Moloney leukemia virus. 



19. 



A packaging cell line of Claim 16 wherein the DNA sequence of interest 
encodes a heterologous therapeutic protein. 
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A packaging cell line compnsing: 

a) a mammalian cell: 

b) a first retroviral nucleotide sequence in the cell which comprises a 
coding sequence for lentivirus gagpol. wherein said coding sequence has 
been mutagenized to improve expression of the lentivirus gagpol 
proteins; and 

c) a second retroviral nucleotide sequence in the cell which comprises a 
DNA sequence of interest and lentivirus cis-acting sequences required 
for packaging, reverse transcription and integration. 

A packaging cell line of Claim 20 wherein the DNA sequence of interest 
encodes a heterologous therapeutic protein. 

A packaging cell line comprising: 

a) a mammalian cell; 

b) a first retroviral nucleotide sequence in the cell which comprises a 
coding sequence for lentivirus gagpol, wherein said coding sequence has 
been mutagenized to improve expression of the lentivirus gagpol 
proteins; and 

c) a second retroviral nucleotide sequence in the cell which comprises the 
coding sequence for a heterologous envelope protein. 

A method of producing a packaging cell line for producing a viral accessory 
protein independent lentivirus-derived retroviral vector particle, comprising co- 
trans fecting mammalian host cells with: 

a) a first plasmid comprising a DNA sequence which encodes lentivirus 
gagpol proteins, wherein said DNA sequence has been mutagenized to 
improve expression of the lentivirus gag and pol proteins; 
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b) 



a second plasmid comprising a DNA sequence which encodes a 
heterologous envelope protein; and 

a third plasmid comprising a DNA sequence of interest and lentivirus 
cis-acting sequences required for packaging, reverse transcription and - 



5 



integration. 



24. A method of Claim 23 wherein the heterologous envelope protein is the G 
glycoprotein of vesicular stomatitis virus (VSV G). 

25. A method of Claim 23 wherein the heterologous envelope protein is the 
amphotropic envelope of the Moloney leukemia virus. 

10 26. A method of Claim 23 wherein the DNA sequence of interest is a heterologous 
therapeutic protein. „ 

27. A method of producing a viral accessory protein independent ientivirus-derived 
retroviral vector particle comprising co-transfecting mammalian host cells with: 



a) 



a first plasmid comprising a DNA sequence which encodes lentivirus 



15 



gagpol proteins, wherein said DNA sequence has been mutagenized to 
improve expression of the lentivirus gagpol proteins; 
a second plasmid comprising a DNA sequence which encodes a 
heterologous envelope protein; and 

a third plasmid comprising a DNA sequence of interest and lentivirus 



b) 



20 



cis-acting sequences required for packaging, reverse transcription and 
integration. 



28. A method of (Maim 27 wherein the heterologous envelope protein is the G 
glycoprotein cf vesicular stomatitis virus ( VSV G). 
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29. A method of Claim 27 wherein the heterologous envelope protein is the 
amphotropic envelope of the Moloney leukemia virus. 

30. A method of Claim 27 wherein the DNA sequence of interest encodes a 
heterologous therapeutic protein. 

5 31. A viral accessory protein independent HIV-denved retroviral vector particle 

produced by the method comprising co-transfecting mammalian host cells with: 
a) a first plasmid comprising a DNA sequence which encodes HIV gagpal 

proteins, wherein said DNA sequence has been mutagenized to improve 

expression of the HIV gagpol proteins; 
10 b) a second plasmid comprising a DNA sequence which encodes a 

heterologous envelope protein; and 
c) a third plasmid comprising a DNA sequence of interest and HIV cis- 

acting sequences required for packaging, reverse transcription and 

integration. 

1 5 32. A method of Claim 3 1 wherein the heterologous envelope protein is the G 
glycoprotein of vesicular stomatitis virus (VSV G). 

33. A method of Claim 3 1 wherein the heterologous envelope protein is the 
amphotropic envelope of the Moloney leukemia virus. 



34. 

20 



A method of Claim 3 1 wherein the DNA sequence of interest encodes a 
heterologous therapeutic protein. 
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35. A viral accessory protein independent lentivirus-dcrived retroviral vector 
particle produced hy the method comprising co-transfecting mammalian host 
cells with: 

a) a first plasmid comprising a DNA sequence which encodes lentivirus - 
gagpol proteins, wherein said DNA sequence has been mutageni/ed to 
improve expression of the lentivirus gagpol proteins; 

b) a second plasmid comprising a DNA sequence which encodes a 
heterologous envelope protein: and 

c) a third plasmid comprising a DNA sequence of interest and lentivirus 
cis-acting sequences required for packaging, reverse transcription and 
integration. 

36. A method of Claim 35 wherein the heterologous envelope protein is the G 
glycoprotein of vesicular stomatitis virus (VSV G). 

37. A method of Claim 35 wherein the heterologous envelope protein is the 
amphotropic envelope of the Moloney leukemia virus. 

38. A method of Claim 35 wherein the DNA sequence of interest encodes a 
heterologous therapeutic protein. 

39. Isolated DNA encoding a codon optimized HIV gagpol. 

40. Isolated DNA encoding a codon optimized HIV gag. 

41. Isolated DNA of Claim 40 comprising the nucleotide sequence of SEQ ID NO:4. 

42. Isolated DNA encoding a codon optimized HIV pol. 
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4.v Isolated DNA of Claim 42 comprising the nucleotide sequence of SEQ ID 
NO: 10. 

44. A method of introducing a DNA sequence of interest into a mammal comprising 
introducing into said mammal a viral accessor}' protein independent HIV- 

5 derived retroviral vector particle comprising the DNA sequence of interest. 

45. The method of Claim 44 wherein the mammal is a human. 

46. The method of Claim 44 wherein the DNA sequence of interest encodes a 
heterologous therapeutic protein. 

47. A method of introducing a DNA sequence of interest into a mammal comprising 

10 the steps of: 

a) introducing into cells a viral accessor.' protein independent HIV-derived 
retroviral vector particle comprising the DNA sequence of interest; and 

b) returning the cells obtained in step a) to the mammal. 

48. The method of Claim 47 wherein the mammal is a human. 

15 49. The method of Claim 47 wherein the DNA sequence of interest is a heterologous 
therapeutic protein. 
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Rev 

• Regulates HIV gene expression by promoting 
cytoplasmic levels of unspliced and singly spliced 
mRNAs 

• Postulated to affect splicing, stability, transport, 
and translation 



Fig. 4 
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Codon Optimization of HIV gagpol 

• Remove A-rich instability elements 

• Improve translational efficiency 

• Reduce risk of recombination with 
transfer vector 

Fig. 5 
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Alignment Report c f Zoccr oc:—Z3Ucn . gag; ME 3. using Costa, meihca -v.t." 'es.aue weight tac:e 
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AJignment Report of Codon optimization (gag). MEG, using Ciustal methca with PAM250 residue weight table. 
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Alignment Recort of Codon optimization ^gagJ.MEG. using Clustal method with PAM250 residue weight table 
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CCA 


CCA 


GAA 


GAG AGC 




AGG 


TTT 








2190 
















2220 
i 












2197 


G E 




T 




T P 


5 


Q 


K 


Q 2 


p 




D 


NL4-3 genbank. 


SEQ 


2187 


GGG GAA 


GAG 


ACA 


ACA 


ACT CCC 




CAG 


AAG 


CAG GAG 


CCG 


ATA 


GAC 






2714 


G E 


£ 


T 


T 


T p 


5 


Q 


K 


Q E 


p 






?HDMHgpm2 - seq 




2^14 


GGG GAA 


GAG 


ACA 


ACA 


ACT CGC 


TCT 


CAG 


.AAG 


v_r\.G ■jj-Mj 




ATA 


GAC 







Fig. 8D 
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Ajignment Report of Codon optimization (gag). MEG. usng Clustai .nethco with 3AM25Q r 



eaaue weight tabie 



2:22 K Z I Y ? 1 

iL^z2 AAG GAA CTG TAT CCT TTA 

2~59 K E L Y ? L 

2 759 AAG GAA CTG TAT C 



: TCC CTC AGA TCA 
L A S L R 3 
« TA oCT TCC CTC AGA TCA 



AGC 



2280 



2277 D ? 5 S Q ■ . 

22 77 GAC CCC TCG TCA CAA TAA 

2304 D ? S S Q 

2 9 04 GAC CCC TCG TCA CAA TAA 



NL4-3 genbank.S 
pKDMHgprr2 . seq 



Fig. 8E 
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Alignment ReDCrt of Cocion Optimization (pel). MEG, using Oustal method with PAM250 residue weight table 





2 0 9 0 


















2 


12 0 










2097 


F 




R 




D 




A 


F 


p 


Q 


G 


V 


.A 


B 




NL4-3 genbank 


2087 






AGG 


GAA 


GAT 


CTG 


GCC 


TTC 


CCA 


CAA 




AAa 




AGG 


'jAA 




2035 


F 






E 


D 


L 


A 




P 




G 




A 


B 


E 


pML4 - 3 . s eq 


20*35 






AGG 


GAA 


GAT 


CTG 




T^r 


CCA 


CAA 


GGG 




GCC 


AGG 


GAA 




2 6L2 


IT 






E 


D 




A 




P 


Q 


G 




A. 


a 


E 


pHDMHgpm2 . seq 


26L2 


TTT 




AGG 


GAA 


GAT 


CTG 




TTC 


CCA 


CAA 


'j j j 


AA 0 


oc c 


AGG 


GAA 
















2150 




















2132 


F 


S 




£ 


Q 


T 


B 


A 


N 


S 


P 


T 


R 


B 




NL4-3 genbank; 


2132 




TCT 


TCA 


GAG 


CAG 


ACC 


AGA 


GCC 


AAC 


AGC 


ceo 


A : — 


AGA 


AGA 


GAG 




21 30 




5 




E 




T 


B 


A 


U 


5 


p 


T 


p 


R 




pNL4-3 . seq 


2130 






TCA 


GAG 


CAG 


AC 0 


AGA 


G0C 


AAC 


A'o^ 


cc ~ 


>*.■.. 


a :-a 


AGA 


GAG 




26.57 




5 


3 


E 


2 


T 


R 


A 


N 


5 


p 


T 


B 


B 


IT 


pHDMHgpra2 . seq 


265^ 


TTT 




TCA 


GAG 


2 AG 


AC Z 


AGA 


GCC 


AAC 


AGC 


0 0 0 


AGO 


A.GA 


AGA 


GAG 






2130 




















2 1 0 










21^-7 


L 


Q 


V 


W 




B 


D 


N 


N 


5 




3 


? 


A 


G 


NH-0 genbank 




CTT 


CAG 


GTT 


TGG 


GGA 


AGA 


GAC 


AAC 


AAC 


TCC 


ot: 


T-0A 


jA^. 


GCA 


G'GA 




2175 


L 


Q 


V 


W 




B 


25 


N 


tl 


5 


L 


3 




A 




pNL4-3 . seq 


21^5 


CTT 


CAG 


GTT 


TGG 


G'GA 


AGA 


GAC 


AAC 


AAC 


TCC 


CTC 


T : 0A 


^AA 


GCA 


GGA 




2702 




Q 


V 


W 


'j 


B 


D 


M 


N 


S 


L 


3 


- 


A 




pHDMKgpm2 • seq 


2702 


CTT 


CAG 


GTT 


TGG 


GGA. 


AGA 


0AC 


AAC 


AAC 


TC 0 




T OA 


■0AA 


GCA 


GGA 
















2 


r 

240 

i 




















-i i o i 


A 


D 


B 


Q 




T 


V 


3 


F 


5 




p 




0 


T 


NL4-0 genbank 


i n i 


GCC 


GAT 


AGA 


CAA 


3-GA 


ACT 


OTA 




TTT 


AGO 


TTC 


COT 


CAG 


ATC 


ACT 




2220 


A 


D 


B 


Q 




T 


V 


3 




S 




o 




I 




pNL4- 3 . seq 


2220 


GCC 


GAT 


AGA 


CAA 


G* jA 


ACT 


GTA 






AGO 


i * 


CCT 


'0AG 


ATC 


ACT 




2^ 


A 


D 


R 


Q 




T 


V 


5 


r 


5 




p 






T 


pHDMHgpm2 . seq 


2 n 4" 


GCC 


GAT 


AGA 


CAA 


0GA 


ACT 


GTA 


TCC 




AGO 


TTC 




GAG 


ATC 


ACT 






2 


T ■' 

270 


















i 

23C0 










226~ 


L 


W 


Q 


R 


? 


L 


V 


T 


I 


K 


I 


'j 


G 






ML4-3 genbank 


226" 


CTT 


TGG 


CAG 


CGA 


CCC 


CTC 


GTC 


a:a 


ATA 


AAG 


ATA 




OGG 


CAA 


TTA 




2265 


L 


W 


Q 


B 


D 


L 


V 


T 




K 




■ j 




c 


L 


pNL4-3 . sec 


0 2 65 


CTT 


TGG 


CAG 


CGA 




CTC 




A OA 


ATA 


AAG 


ATA 






CAA 


TTA 




2792 


L 


W 


Q 


R 


p 


L 


V 


T 


I 


K 




G 




Q 




pHDMHgpmz . seq 


2792 


CTT 


TGG 


CAG 


CGA 




CTC 


GTC 


ACA 


ATA 


AAO 


ATC 


J»J J. 




CAG 


CTG 





2330 



1312 K E A L L D T G A D D T 7 L E NL4-3 genbank. SEQ 

?312 AAG GAA GCT CTA TTA GAT ACA GGA GCA GAT GAT ACA GTA TTA GAA 

1210 K E A L L D T GAD D T 7 L E pNL4-3.seq 

:310 AAG GAA GCT CTA TTA GAT ACA GGA GCA GAT GAT ACA GTA TTA GAA 

: 937 KEALLDTGAD0T7LE pHDMHgpm2 . seq 

:S37 AAG GAG GCC CTG CTG GAC ACC GGC GCC GAC GAC AC0 GTG CTG GAG 



Fig. 9A 
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Align™, Reocl of Cccon Op t in,za,cn -poU.MEG. uang Custa, ..erhoa „th PAM250 res.aue we.gm * w .. 



W K ? K m 



M N I ? r, a 



j rtj,-. . GG AAA 



_ K P K M Z ■-: - ..r- - - 

- M - TG /V ' T TTG C ~A GGA AGA TGG AAA CCA AAA A~G A" A ''^^ 

2582 E M K L ? G R W K * v x ' " -° 

2B82 GAG ATG AAC CTG CCC GGC CGC TGG AAG CCC AAG ATG ATC — -- ?d ^^ ' 5 e ~ 



GGC 



2420 



2402 I G G F 



K V G Q Y D Q 



4^2 ATT GGA GGT TTT A~~ AAA r^a ^ra r*- — ^ ^ " 1 N: - 4 "2 genban* . SE ; 

~ UtA jGA GAT CAG ATA C^C A~A 

t4,j0 G ? I K V R Q y p g r t * 

2400 ATT GGA GGT TTT ATC AAA. GTA AGA CAG TAT GAT CAG A^A -C ^ P ^ " ^ 

*- 92 ' 1 G G f : k v a q * d o - : : 

292^ ATC GGC GGC T~C V : — J„ " * " 1 pKDMKgp.T^ . s e - 

.-v, 



- 2 il C 24 8 0 



2447 E I c G 



K A I G T V 



2447 GAA ATC TGC GGA CAT AAA GCT ATA GGT ACA GTA TTA GTA GGA CCT ' ?er — SE; 

' EIi -GHKAIGTVLV~d 

ATC TGC GGA W AAA GCT ATA GGT ACA GTA TTA GTA GGA C~- P™' 3 ' 
2,12 E I C G H K A T Q , v , - I „ 

2*72 GAG ATC TGC GGC CAC -AAG GCC ATC GGC ACC GTG CTG GTG GGC CCC PHBMH9pn2 



2 4 92 


T 


P 


V 


24 92 


ACA 




GTC 


2490 


T 


P 


V 


24 90 


ACA 


CCT 


GTC 


3C17 


T 


P 


V 


3CI7 


ACC 


CCC 


GTG 




2 


540 

i 




2537 


C 


T 


L 


25 37 


TGC 


ACT 


TTA 


25 35 


C 


T 


L 


25 35 


TGC 


ACT 


TTA 


3 062 




T 


L 



2510 
1 



IlGRN LLTO'rMi^^ ■ , 
™ *- ( jNL4-^ oenDanV 
TA ATT r,r.a n „ „ m ^ uc.iudaf.,it v 

T GGC 

G pNL4-3.sea 



1 I G R N L L T Q 



tGRNLLTQ 



G pHDKHgpm2 . seq 



2570 
L 



J1 T ' rl" *L S ? 1 E T V P V NL4 " 3 genbank.SEQ 
CCC ATT AGT CCT ATT GAG ACT GTA CCA GTA 

N F P 1 s -° I E T V P V DNL4-3 sea 
TTT CCC ATT AGT CCT ATT GAG ACT GTA CCA G-A 

^ ATC TCC CCC AT C GAG ACC GTG r C r~ r^- 



2600 

-582 K L K ? G M ' 



<■ J ° * * L K ? G M D G P K V K n u ^~ 

2587 jna t ta AAr .^ ri rr; . _ m _ ^ w ? nl-1-3 aenbank.SEC 

^c.n " A ^ G ^ T GGC CCA ^ GTT AAA CAA TGG C^A 

KLKPGMDG PKVKO W rnMT ) , 

2580 AAA TTA AAG CCA GGA ATG GAT GGC CCA AAA GTT AAA CAA TGG cr A ~ 3 . seq 

10 KLKPG MDGD KVK ^ tj : 

3107 « G CTG CCC GGC ATG GAC GGC CCC AAA GTC AAG CAG TGG CCC PHDMH *^-"" 



Fig. 9B 
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Alignment Reoort cf Codon Optimization (pol).MEG. js.rg Clustai mevioa with =AM250 'es.due weight table 



2 63C 0 „ 

2 6 d u 



262" 



K I K A L V E 



NL4-3 qenDarrk . SEQ 



2 62 7 TTG ACA GAA GAA AAA ATA AAA GCA TTA GTA GAA ATT TG" AC^ -i- 

2625 L T E E K I K A L V E 2 C * - , 

262S TTG ACA GAA GAA AAA ATA AAA. GCA TTA GTA GAA A- t G t a " ~ ?N ~ 4 - J - se q 

3152 CTG ACC GAG GAG AAG ATC a!g GC= CTG GTG GAG ATC TGC ACC GAG PKDMHgPm2 * 



2690 



2672 


M 


E 


K 


E 


G 


K 


r 


5 


K 


J. 


2672 


ATG 


GAA 


AAG 


GAA 


GGA 


AAA 


ATT 


TCA 


AAA 


ATT 


2670 


M 




K 


E 


G 


K 


r 


s 


K 


r 


267G 


ATG 


GAA 


AAG 


GAA 


GGA 


AAA 


ATT 


TCA 


.AAA 


ATT 


3197 


M 




X 


E 


G 


K 




5 






3137 


ATG 


■GAG 


AAG 


GAG 


'oGC 


AAG 


ATC 






ATC 



G ? E N P genbanJc.SEQ 
-GG CCT GAA AAT C CA 

G P E M ? pNL4-3.seq 
■>GG CCT GAA .AAT CCA 

^ ENp pHDMHgpni2.seq 

CCC GAG AAC CCC 



21 ?° 275C 



2717 


V 


u 


T 


P 


V 




A 


I 


K 


K 


K 


D 


S T 


2717 


TAC 


AAT 


ACT 


CCA 


GTA 


TTT 


GCC 


ATA 


AAG 


AAA 


AAA 


■GAC 


AGT ACT 


2715 


V 


N 


T 


p 


V 


r 


A 


i. 


K 


K 


K 


D 


S T 


2715 


TAC 


AAT 


ACT 


CCA 


GTA 


T^T 


GCC 


ATA 


AAG 


AAA 


AAA 


GAC 


AGT ACT 


3242 


V 


N 


T 


p 


V 




A 


I 


K 


K 


K 


D 


5 T 



K NL4-3 genbank.SEQ 
*AA 

K pNL4- 3. sec 



3242 TAC AAr ar- r-- ^ ™ 5 T K pHDMHgpm2 . seq 

3 2 4^ TAC AAC AC^ CCC GTG TTC GCC ATC AAG AAG AAG GAC TCC ACC AAG 



2780 



2762 


W 


R 


K 


L 


V 


D 




K 




L 


N 


:< 


2762 


TGG 


AGA 


AAA 


TTA 


GTA 


GAT 




AGA 


■GAA 




.AAT 


AAG 


2760 


W 


P. 


X 


L 


V 


D 




R 




L 




K 


2760 


TGG 


AGA 


AAA 


TTA 


GTA 


GAT 




AGA 


GAA 




AAT 


AAG 


3287 


W 


R 


K 


L 


V 


D 


F 


R 






N 


X 


32 9 7 


TGG 


CGC 


AAG 




GTG 


GAC 


TTC 




GAG 


CTG 


.AAC 


AAG 




2310 


















2940 


2907 


D 




W 




V 


Q 


L 




I 


p 


H 


? 


2907 


GAT 


TTC 


TGG 


GAA 


GTT 


GAA 


TTA 


GGA 


ATA 


CCA 


CAT 


CCT 


2805 


D 




W 


E 


V 


Q 








? 


H 


p 


2905 


GAT 


TTC 


TGG 


GAA 


GTT 


CAA 


TTA 


GGA 


ATA 


CCA 


CAT 


CCT 


3332 


D 


F 


W 


E 


V 


Q 




G 


T 


P 


X 


p 


3332 


GAC 




TGG 


GAG 


GTG 


CAG 


CTG 


GGC 


ATC 




CAC 





a T C NL4-3 genbank.SEQ 
\GA ACT CAA 

R T Q pNL4~3.seq 
^GA ACT CAA 

R C Q pHDMHgpm2 .seq 



A G 1 NL4-3 genbar.k . SEQ 
jCA ijG'j TTA 

A G L pNL4 - 3 . seq 
jCA GGG TTA 

A G L pHDMKgpml . seq 



29"0 

2 352 ~ 



3377 



K 


Q 


K 


K 


S 


V 


i 


V 


L 


D 


V 


G 


D 


A 


AAA 


CAG 


AAA 


AAA 


TCA 


GTA 


ACA 


GTA 


CTG 


GAT 


GTG 


GGC 


GAT 


GCA 


K 


Q 


K 


K 


S 


V 




V 


L 


D 


V 


G 


D 


A 


AAA 


CAG 


AAA 


AAA 


TCA 


GTA 


ACA 


GTA 


CTG 


GAT 


GTG 


GGC 


GAT 


GCA 


K 


Q 


K 


K 


S 


V 




V 


L 


D 


V 


G 


D 


A 


AAG 


CAG 


AAG 


AAG 






ACC 


GTG 


CTG 


GAC 


j - G 


GGC 


GAC 


G\ 



Y NL4-3 genbank.SEQ 
FAT 

Y pML4-3.sec 

rAT 

Y pHDMHgpm2 . seq 
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Alignment Revor. of Codcn Cotmizadon ; poi) MEG, js.ng Ciusta, ^ethca with P&M25C rescue we.gnt laoie. 



2 3 97 F 5 7 - * n '-' ^ " ~ ? ^ ~ ~ ~ 

2S ~~ _ . _ . _ . , ^ J - r ^-ba^.si; 

2 9 5 5 F G '/ 5 r - - a ^- - 1 * 

tqo; -n.-. _ „'„'. " rt = pN-4-j.sec 

^ ^ " - '- - - - - — - j-ja : jj->.'_ . TC AGG AAG T. n T A-""" "A mmm — 

3422 E" S V P L D K = r P K v ~* * V u-ht- - 

. j.o ~.G GAC AAG GA.C .T^. CGC AAG TAC ACC GC" 



2960 



»« ACC ATA CCT ^ ATA .4 AAT GAC ACA CCA GGG ATT AGA 4 CAO ""^ -"^'^ 

^^IR^g DNL4-3 se<~ 
2 94 0 ACC ATA CCT AGT ATA AAC AA.T GAG ACA CCA GGG ATT AGA TAT CAG ^ 
34 67 T I ? s t \'N r r pr ~ 

, A .._ _ _ ^ _ " L F G " n * C pHDMHgpm2 . seq 

- i c .■■ A^~ C^ TCC ATC AAC AA.G GAG ACC CCC GGC A.TC GGG TAC CAG 



255C 3C20 

I_ 

NL4-3 ger.bank . SZQ 



298" v N V L ? C G W K G 5 ? A I 

2 98 7 TAC AAT GTG CTT CCA CAG GGA TGG AAA GGA TCA CCA GCA AT^ TTC 

Y H V L ? Q G W K G S P A I F pNL4-3. S eq 

.98 5 TAC AAT GTG CTT CCA CAG GGA TGG AAA. GGA TCA GCA GC* ^TA T m C 

* " V L P 2 = W K G S P A I ; P HDMHgpm2.se q 

--i*. T.*C AAC GTG CTG CCC* CAG GGC TGG AAG G ~,C TCC CCC GCC ATC TTC 



3050 



Q : S M T K I L F. P F R K Q N NL4-3 ger.bank.SEQ 
3 0 3. CAG TGT AGC ATG ACA AAA ATC TTA GAG CGT TTT AGA AAA C\A AA"" 

3030 0 '- S M T K I LEPFRKQN pNL4-3. S eq 

3 0 30 CAG T'jT AGC ATG ACA AAA ATC TTA GAG CCT TTT AGA AAA CAA AAT 

3557 Q C 5 M T K I L F P F R K Q N pHDMHgpm2 . seq 

3 557 CAG TGC TCC ATG ACC AAG ATC CTG GAG CCC TTC CGC AAG CAG AAC 

— ! — , . 

3080 3110 
1 . i 



3077 P D 1 V 1 Y 2 Y M D D L Y V G NL4-3 genfaank.SEQ 
3-7/7 CCA GAC ATA GTC ATC TAT CAA TAC ATG GAT GAT TTG TAT GTA GGA 

3075 P D 1 V 1 Y Q Y M D D L Y V G P NL4-3.seq 



3C7 5 CCA GAC ATA GTC ATC TAT CAA TAC ATG GAT GAT TTG TAT GTA GGA 
36 02 P : I v I y Q Y M D D L Y 
jc^_ G^C ATC GTG ATC TAC CAG TAC ATG GAC GAC GTG TAC G' 



3140 



J D L E 1 3 H R T K - E E L NL4-3 aeru=ank.5EC 

iCl GAC TTA ATA GjG CAG CAT AGA ACA AA.A ATA GAG GAA C^G 

3120 3 D L E 1 - Q H R T K I E E L P NL4-3. S eq 

3120 TCT GAC TTA GAA ATA GGG CAG CAT AGA ACA AAA ATA GAG GAA CTG 

3647 5 ° L E 1 G Q H R T K : E E L pHDMHgpmZ.seq 

3b4 , TCC GAC CTG GAG ATC GGC CAG CAG CGC ACC AAG ATC GAG GAG CTG 



Fig. 9D 
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Al.gnmen. Report of Codon Oprtm lzation (pol) . MEG . us.ng Clustal method w.h PAM250 res-OLe we,gnt tabi( 





3170 














i I 67 


R Q 


H 




L 


R 


W 


G 


3167 


AGA CAA 


CAT 


CTG 




AGG 


TGG 


GGA 


3165 


R Q 


H 


L 




R 


W 


Q 


316 5 


AGA CAA 


CAT 


CTG 




AGG 


TGG 


GGA 


3692 


R Q 


H 


L 


T 


R 


W 


G 


3692 


CGC CAG 


■CAC 


CTG 


CTG 


CGC 


TGG 


GGC 



3200 



NL4-3 genbar.k.SEQ 



F * T P D K K 

TTT ACC ACA CCA GAC AAA AAA. 

F T T P n 
™ * ^ pNL4-3.sea 

"TT ACC ACA CCA GAC AAA AAA 



F T P D K K 

PTC ACC ACC CCC CAC AAG AAG 



pHDMHgpm2 . seq 



3212 



3212 CAT CAG AAA GAA CCT CCA TTC CTT TGG ATG GGT TAT GAA C t C CAT geRbdnk - SEC 

3 - 10 H Q K E P p p t W M G v P 

3210 CAT CAG AAA GAA CCT CCA TTC CTT TGG ATG GGT TAT GAA CT- CA^ PNL4 - 3 ' Se< * 

3 ' 57 H Q * E P P F L ff M G Y P t r 

3-2.7 CAC CAG AAG GAG CCC CCC TTC CTG TGG ATG GGC 7 AC r^G rT V " P*^DMHgpra2 .sec 



CAC 



3260 



3157 


P 


D 


K 


W 


T 


V 


Q 


3257 


CCT 


GAT 


AAA 


TGG 


ACA 


GTA 


CAG 


3255 


P 


D 


K 


W 


T 


V 


Q 


32 5 5 


CCT 


GAT 


AAA 


TGG 


ACA 


GTA 


CAG 


3782 


P 


D 


K 


W 


T 


V 


Q 


3752 


CCC 


GAC 


AAG 


TGG 


ACC 


GTG 

% 


CAG 



3290 



p 



E K D 
GTG CTG CCA GAA AAG GAC 
V L P £ K 

CCA GAA AAG GAC 
? - V L P £ K D 
CCC ATC GTG CTG CCC GAG AAG GAC 



NL4-3 genbank . SE£ 



* L K D PML4-3. seq 

■^T ATA GTG CTG CCA GAA KTxc r.^r H 

pHDMHgpm2 . seq 



3 320 

3302 ~~s W T V N 



"ol T T f T T G f A | A ^ G ™ ™ ™ ^ ttg aIt nm " genbark SEC 

3300 AGC TGG ACT GTC AAT GAC ATA CAG AAA TTA GTG GGA AAA TTG AAT P" 14 " 3 - 3 ^ 
38:7 T « ™ G GTG aL GAC ATC CAG AAG CTG GTG GGC AAG CTG AAC ' Se * 



3350 339C 



334" 



Q I 



3347 TGG GCA AGT CAG ATT t^t r..~* ^ ™ ... ™ ^_ L G NL4_3 genbank.SEQ 



334 5 



Q I 



V 


A 


G 


I 


K 


V 


R 


Q 


L 


r 1 


TAT 


GCA 


GGG 


ATT 


AAA 


GTA 


AGG 


CAA 


TTA 


TGT 


y 


A 




r 


K 


V 




Q 


L 


C 


TAT 


GCA 


GGG 


ATT 


AAA 


GTA 


A3G 


CAA 


TTA 


TGT 


y 


A 


G 


I 


K 


V 


P 


Q 


T, 


C 



3345 TGG GCA AGT CAG ATT r.™ ... . '-_ _ C PN"-3.seq 

TGG GCC TCC CAG ATC TK GCC GGC ATC AAA CTC 4 CAG C" ' ' ^ 



TGC 



339. 
3392 
3390 
3390 
3917 



3410 



G 


T 


K 


A 


L 


T 




V 


V 


GGA 


ACC 


AAA 


GCA 


CTA 


ACA 


GAA 


GTA 


GTA 


G 


T 


K 


A 


L 


T 


E 


V 


V 


GGA 


ACC 


AAA 


GCA 


CTA 


ACA 


GAA 


GTA 


GTA 


G 


T 


K 


A 


L 


T 


E 


V 


V 


GGC 


ACC 


AAG 


GCC 


CTG 


ACC 


GAG 


GTG 


GTG 



NL4-3 genbar.k.SEQ 
pNL4-3 . seq 
pHDMHgpm2 . seq 
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3 4 4 C 



3 4 3 " 

34 3 5 
3435 
3 962 
3562 



GAA 



ACC GAG GAG GCC GAG CTG GAG 



^ N R £ I 1 pNL4-3.seq_ 

CA GAA AAC AGG GAG ATT CTA 

E N R E I L pHDMHgpin2 . sec 

GAG AAC CGC GAG ATG CTG 



3500 



NL4-3 genbar.k . 5E; 



34 6 2 K E P V H 3 V Y Y D P S K 0 

34 3: AAA GAA CCG GTA CAT GGA GTG TAT TAT GAC CCA TCA AAA GAC TTA 

349C K E ? V H jVYYDPSKDL pNL4-3.sec 

34 8 0 AAA GAA CCG GTA CAT GGA GTG TAT TAT GAC CCA TCA AAA GAC TTA 

4CC7 K E ? Y H G V Y Y D P 5 K D L pKDMHgp-J; . sea 

4 0 07 AAG GAG CCC GTG CAC GGC GTG TAC TAC GAC CCC TCC AAG GAC CTG 



3530 



I 



•*' Q 3 Q G Q W T Y Q ML4-3 genbank . SEC. 

352 7 ATA GCA GAA ATA CAG AAG CAG GCG CAA GGC CAA TGG ACA TAT CAA 

352 5 I A E I Q K Q G Q G 2 W T Y Q pNL4-3.seq 

3 52 5 ATA GCA GAA ATA CAG AAG CAG GGG CAA GGC CAP. TGG ACA TAT CAA 

4 052 I A E I Q ^ K Q G Q G Q W T Y Q ?HDMHgpm2 . seq 
4 052 AT C GCC GAG ATC CAG * AAG CAG GGC CAG GGC CAG TGG ACC TAC CAG 



1 

3590 



3572 I Y Q E P F K N L K T G Y. Y A NL4-3 ger.bank . SEC 

357 2 ATT TAT CAA GAj CCA TTT AAA AAT CTG .AAA ACA GGA AAA TAT GCA 

3570 I YQEPFKNLXTGKYA pNL4-3.seq 

357 0 ATT TAT CAA GA^ CCA TTT AAA AAT CTG AAA ACA GGA AAA TAT GCA 

4097 IYCEPFKNLKTGKYA pHDMHgpm2 . seq 

4 097 ATC TAC CAG GAG CCC TTC AAG AAC CTG AAG ACC GGC .AAA TAC GCC 



3620 3650 
1 . - 1 

3617 RMKGAHTNDVKQLTE NL4-3 genbar.k . SEC. 

3617 AGA ATG AAG GGT GCC CAC ACT AAT GAT GTG AAA CAA TTA ACA GAG 

3615 RMKGAHTNDVKQLTE pNL4-3.seq 

3615 AGA ATG AAG GGT GCC CAC ACT AAT GAT GTG AAA CAA TTA ACA GAG 

4 142 R M K G A H T X C V K Q L T E pHCMHcprrJ . sec 

4142 CGC ATG AAG GGC GCC CAC ACC .AAC GAC GTG .AAG CAG CTG ACC GAG 

i 

3680 

3662 AVOKIATESIVIWGK NL4-3 genban k . SEC. 

3 662 GCA GTA CAA AAA ATA GCZ ACA GAA AGC ATA GTA ATA TGG GGA .AAG 

3660 AVQKIATESIVIWGK pNL4-3.seq 

3 660 GCA GTA CAA AAA ATA GCC ACA GAA AGC ATA GTA ATA TGG GGA AAG 

4 1 8 7 A V Q K 1 A T E S _ I V I W G K pH.DMHgpm2.seq 
413" GCC GTG CAG AAG ATC GC : ACC GAG TCC ATC GTG ATC TGG GGC AAG 
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3710 



374 0 



3707 
3707 
3705 
37 0 5 
4232 
4232 



AC" 



? K F K L P I Q Y. E T W £ A 
CCT AAA. TTT AAA. TTA CCC ATA CAA AA.G GAA ACA TGG GAA GCA 

? F K L P I Q y E T W E A 

CCT AAA TTT AAA TTA CCC ATA CAA AAG GAA. ACA TGG GAA GCA 
PKFKLPIQKETWEA 



CCC AA.G TTC AA.G CTG CCC ATC CAG AAG GAG ACC TGC 



NL4-3 qer.bar.k . SEC 



cNL4 -: 



DHDMHc:cm2 . seer 



GAG 



! 

377C 



HI 2 WW TEYWQATW:p EWe NL4 . 3 ge , bank . SE ; 

3/52 TGG TGG ACA GAG TAT TGG CAA GCC ACC TGG ATT CCT GAG TGG GAG 

3750 WWTEYWQA7WIPEWE D NL4-3.seq 

3750 TGG TGG ACA GAG TAT TGG CAA GCC ACC TGG ATT CCT GAG TGG GAG 

4277 W W T E Y W Q A 7 W I P E W E pHDMHaprrJ: . sea 

4 2/.- TGG TGG ACC GAG TAC TGG CAG GCC ACC TG Z ATC CCC GAG TGG GAG 



3800 



3 8 3 0 



3797 
3797 
3795 
3795 
4322 
4322 



3842 
3342 
3840 
384 0 
4367 
4367 



F V N T P P L V K L W r Q L £ NL4 _ 3 genbank>SSQ 

TTT GTC AAT ACC CCT CCC TTA GTG AAG TTA TGG TAC CAG TTA GAG 

FV NTPPLVKLWr Q LE pHL4-3.seq 

TTT GTC AAT ACC CCT CCC TTA GTG AAG TTA TGG TAC CAG TTA GAG 

F v N T P ? L v P: L W r Q L E pHDMKgpn^.sec 

TTC GTG AA3 ACC CCC C :c CTG GTG AAG CTG TGG TAC CAG CTG GAG 



1 

3860 



K E P 1 1 13 A E T F ^ V D G A NL4-3 genbank.SEC 

AAA GAA C:C ATA ATA GGA GCA GAA ACT TTC TAT GTA GAT GGG GCA 

K E ? I I G A E T F Y V D G A pNL4-5.seq 

AAA GAA CCC ATA ATA GGA GCA GAA ACT TTC TAT GTA GAT GGG GCA 

K E ? 1 1 G A E T F Y V D G A DHDMKgpm2 . sec 

AAG GAG CCC ATC ATC GGC GCC GAG ACC TTC TAC GTG GAC GGC GCC 



3890 



392!) 



3887 ANRETKLGKAGYVTD 
3 88 7 GCC AAT AGG GAA ACT AAA TTA GGA AAA GCA GGA TAT GTA ACT GAC 
38 85ANRETKLGKAGYVTD 
38 8 5 GCC AAT AGG GAA ACT AAA TTA GGA AAA GCA GGA TAT GTA ACT GAC 
44 12 AN R ET K L G KAG Y VT D 
44 12 GCC AAC CGC GAG ACC AA.G CTG GGC AAG GCC GGC TAC GTG ACC GAC 



1 

3950 



3932 R G ?. Q K V V ? L T D T T N Q 

3 932 AGA GGA AGA CAA AAA GTT GTC CCC CTA ACG GAC ACA ACA AAT CAG 
3930 RGRQKVVPLTDTTNC 
393 0 AGA GGA AGA CAA AAA GTT GTC CCC CTA ACG GAC ACA ACA .AAT CAG 
4457 RGRQKVVPLTDTTNQ 

4 4 57 CGC GGC CGC CAG AAG GTG GTG CCC CTG ACC GAC ACC ACC AAC CAG 



NL4-3 genbank.SEC 
PNL4-3 . seq 
pHDMHgpm2 .seq 

NL-J-3 genbar.k. . 5EQ 
pNL4 - 3 . seq 
pHDMHgpm2 . seq 
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3S8C 4CIC 

N'24-3 jer.car.it.SE 
pN'L- - 3 . sec 



3 c-*- 


:-: 7 


Z 






A I 
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2 
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j 


3? — 


AAG ACT 


GAG 




CPA 


GCA ATT 


CA™ 


CTA 




» «. o 




CAT 




uGA 


3 9 " 5 


K T 








A I 


u 




A 








3 






AAG ACT 




TTA 


CAA 


GCA ATT 


CAT 


CTA 






CAG 


GAT 




jGA 


4 5 02 


K T 


C 


L 




A I 


w 




A 


T 


c 




S 


j 


4532 


AAG ACC 


GAG 


CT G 


CA.G 


GCC ATC 


CAC 


CTG 


GCC 


CTG 


CAA 


GAC 


TCC 














l 

4 u 4 U 

1 


















4022 


L E 


V 


N 


I 


V T 


D 


S 


Q 


Y 


A 


L 




T 


4C22 


TTA GAA 


GTA 


AAC 


ATA 


GTG ACA 


GAC 


TCA 


CAA 


TAT 


GCA 




GGA 


ATC 


4C20 


L E 


V 


N 


T 


V T 


D 


S 


Q 


V 


A 




G 




4 C2 0 


TTA GAA 


GTA 


AAC 


ATA 


GTG ACA 


GAC 


TCA 


CAA 


TAT 


GCA 


TTG 


GGA 


ATC 


4547 


L £ 


V 


N 


T 


V T 




S 


Q 


V 


A 


L 


G 




454" 


CTG GAG 


GTG 


AAC 


ATC 


GTG ACC 


GAC 


TCC 


CAG 


TAT 


GCA 


TTG 




ATC 




4070 
















4 


~l 

IOC 

! 








4 067 


: c 


A 


Q 


P 


D K 










L 


V 


S 




406^ 


ATT CAA 


GCA 


CAA 


CCA 


GAT AAG 


AGT 


GAA 


TCA 


GAG 


TTA 


GTC 


AGT 


CAA 


4065 


I Q 


A 


Q 


P 


D K 


S 




S 


E 




V 


S 




4 065 


ATT CAA 


GCA 


CAA 


CCA 


GAT AAG 


AGT 


GAA 


TCA 


GAG 


TTA 


GTC 


AGT 


CAA 


4592 


: Q 


A 


Q 


P 


D K 


S 


E 


S 


E 


L 


V 


S 




45S2 


ATC CAG 


GCC 


CAG 


:cc 


GAC AAG 


TCC 


GAG 


TCC 


GAG 


CTG 


GTG 


TCC 


CAG 



NL4-3 gent an k . SEC 
pNL4-3 . see 
pKDMHgpm2 .sec 

NL4-3 genbank.SEQ 



4130 



41:2 IIEQLIKKEKVYLAW NL4-3 genbar.k.SEQ 

41i2 ATA ATA GAG CAG TTA ATA AAA AAG GAA AAA GTC TAC CTG GCA TGG 

4H0 II EQLI KKEKVYLAW pNL4-3.seq 

4110 ATA ATA GAG CAG TTA ATA AAA AAG GAA AAA GTC TAC CTG GCA TGG 

4637 IIE Q L IKKEKVY LAW pHDMHgp.-n2.seq 

4 637 ATC ATC GAG CAG CTG ATC AAG AAG GAG AAG GTG TAC CTG GCC TGG 



4160 4ig 0 



4157 VPAHKGIGGNEQVDG NL4-3 genbank.SEQ 

4157 GTA CCA GCA CAC AAA GGA ATT GGA GGA AAT GAA CAA GTA GAT GGG 

4155 VPAHKGIGGNEQVDK pNL4-3.sea 

4155 GTA CCA GCA CAC AAA GGA ATT GGA GGA AAT GAA CAA GTA GAT AAG 

4682 VPAHKGIGGNEQVDK pHDMHgr>rn2 . sea 

4 6a 2 GTG CCC GCC CAC AAG GGC ATC GGC GGC AAC GAG CAG GTG GAC AAG 



1 

4220 



4202 LVSAGIRKVLFLDGI NL4-3 genbank.SEQ 

42 C 2 TTG GTC AGT GCT GGA ATC AGG AAA GTA CTA TTT TTA GAT GGA ATA 

4200 LVSAGIRKVLFLDGI cNL4-3.seq 

42 0 0 TTG GTC AGT GCT GGA ATC AGG .AAA GTA CTA TTT TTA GAT GGA ATA 

4727 LVSAGIRKVLFLDGI pHDMHgpm2 . sec 

4 72 7 CTG GTG TCC GCC GGC ATC CGC AAG GTG CTG TTC CTG GAC GGC ATC 
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4250 



428C 



4247 
4247 
4245 
42 4 5 
4772 
4772 



4292 
4292 
4290 
4290 
4^1~ 
4617 



gat aag gcc caa gaa gaa cat gag aaa tat cac agt aat tgg aga 
dkaqeehekyhsnw'r 

GAT AAG GCC CAA GAA GAA CAT GAG AAA TAT CAC AGT AAT TGG AGA 

DKA QEEHEKYHSNWR 
GAC AAG GCC CAG GAG GAG CAC GAG AAG TAC CAC TCC AA.C TGG CGC 



NL4-2 genbank.SEQ 



pNL4 - j . 5 ec 



4310 



4 3 37 
4337 
4335 
4235 
4862 
4862 



4352 
4382 
4380 
4 3-8 0 
4907 
4907 



4 42 7 
4427 
4425 
4425 
4952 
4952 
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AAC 


CTG 


ccz 


CCC 


GTG 


GTG 


GCC 


.AAG 


GAG 


434 C 


















1 

4 37 0 

i 








: v 


A 


5 


C 


D 


K 




Q 


L 


K 


G 




A 


M 


ATA GTA 


GCC 
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AAA 


TGT 
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CTA 


AAA 


GGG 


GAA 
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ATG 
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S 


C 


D 
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E 


A 


M 


ATA GTA 


GCC 
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TGT 


GAT 


AAA 


TGT 


CAG 


CTA 


AAA 


GGG 


GAA 


GCC 


ATG 


I V 


A 


S 


C 


D 


K 


C 


Q 


L 


K 


G 


E 


A 


M 


ATC GTG 


GCC 


TCC 


TGC 


GAC 


AAG 


TGC 


CAG 


CTG 


AAG 


GGC 


GAG 




ATG 


l ■ — ■ 

4400 

i 


H G 


Q 


V 


D 


C 


S 


P 


G 


I 


W 


Q 


T 


D 


C 


CAT GGA 


CAA 


GTA 


GAC 


TGT 


AGC 


CCA 


GGA 


ATA 


TGG 


CAG 


CTA 


GAT 


TGT 


H G 


Q 


V 


D 


C 


5 


P 


G 


I 


W 


Q 


L 


Ei 


C 


CAT GGA 


CAA 


GTA 


GAC 


TGT 
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CCA 


GGA 


ATA 


TGG 


CAG 


CTA 


GAT 


TGT 


H G 
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D 


C 
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p 


G 
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Q 
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C 


CAC GGC 


CAG 


GTG 


GAC 


TGC 


TCC 


CCC 


GGC 


ATC 


TGG 


CAG 


CTG 


GAC 


TGC 


~ 1 ■ r - - 

4430 
i 


















I 

4460 
i 








T H 
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GAA 
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AAA 


GTT 
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TTG 


GTA 
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GTT 
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GTA 


GCC 


T H 
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L 
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ACA CAT 


TTA 


GAA 


GGA 


AAA 


GTT 
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GTT 


CAT 


GTA 
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T H 
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V 
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ACC CAC 


CTG 


GAG 


GGC 


AAG 


GTG 


ATC 


CTG 


GTG 


GCC 


GTG 


CAC 


GTG 


GCC 



4490 



4472 
4472 
4470 
4470 
4997 
4997 



SGYIEAEVIPAETGQ 
AGT GGA TAT ATA GAA GCA GAA GTA ATT CCA GCA GAG ACA GGG CAA 

SGYIEAEVIPAETGQ 
AGT GGA TAT ATA GAA GCA GAA GTA ATT CCA GCA GAG ACA GGG CAA 

SGYIEAEVIPAETGQ 
TCC GGC TAC ATC GAG GCC GAG GTG ATC CCC GCC GAG ACC GGC CAG 



pHDMHgpm2 . sec 

NL4-3 genbank.SEQ 
pNL4 -3 . seq 
pHDMHgpm2 . seq 

NL4-3 genbanx . SEQ 
PNL4-3 .seq 
pHDMKgpm2 .see 

NL4-3 genbank.SEQ 
pNL4-3 . seq 
pHDMHgpm2 . seq 

NL4-3 genbank.SEQ 
pNL4-3 . seq 
pHDMHgpru2 . seq 

NL4-3 genbank.SEQ 
pNL4-3 . seq 
pHDKHgpm2 . seq 
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■j G A 
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G 
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5 042 
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4 


58G 


















4 5 62 
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V 
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T 


rj 


N 




s 


N 




T 




T 


T 

1 


456Z 


AAA. A.. A 


GTA 


CAT 


ACA 


GAC 


AAT 


G GC 


AGC 


AAT 


TTC 


ACC 


AGT 


ACT 


ACA 


4560 


K T 


V 


H 




D 


N 




S 


N 


F 




S 




T 


4560 


AAA ACA 


GTA 


CAT 


ACA 


GAC 


AAT 


GGC 


AGC 


AAT 




ACC 


AGT 


ACT 


ACA 


5087 


K 7 


V 


H 




D 


N 


G 


S 


N 




T 


s 


T 


T 


508- 


AAG AC C 


GTG 


CAC 


ACC 


GAC 


AAC 


GGC 


TCC 


.AAC 




ACC 


TCC 


ACC 


ACC 




461 D 


















464 C 








4607 


v :-■ 


A 


A 


C 


W 


W 


A 




I 


K 


C 


£ 


F 


G 


4607 


GTT AAG 


GCC 


GCC 


TGT 


TGG 


T 


GCG 




ATC 


AAG 


CAG 


GAA 




tj*jC 


46C5 


V K 
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A 
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W 


w 


A 
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K 


Q 


E 




G 


4605 


GTT AAG 


GCC 


GCC 


TGT 


TGG 


TGG 


GCG 


G jG 


ATC 


AAG 


CAG 


GAA. 




GGC 


5132 


V K 
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C 


W 


W 


A 


' 3 


7 


K 
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E 


F 




5132 


GTG AAG 


GCC 


GCC 


TGC 


TGG 

4 


TGG 


GCC 


GGC 


ATC 


AAG 


CAG 


GAG 


TTC 


GGC 



ptiL4 - 3 . s 



N"L4-3 gencar.k . SE^ 
pNL4 - 3 . s eq 
pHDMHgpm2 . seq 



4670 



4652 I e*NP QS QSVIESMN NL4-3 genoank.SEQ 

4 652 ATT CCC TAC AAT CCC CAA AGT CAA GGA GTA ATA GAA TCT ATG AAT 

4650 1 P Y N P Q S Q G V I E S M N pNL4-3.seq 

4650 ATT CC: TAC AAT CCC CAA AGT CAA GGA GTA ATA GAA TCT ATG AAT 

5177 * PYN PQS 2GVIESMN pHDMHgprr.2 . seq 

517? ATC CCC TAC AAC CCC CAG TCC CAG GGC GTG ATC GAG TCC ATG AAC 



47Q0 473c 



4657 KELKKIIGCVRDQAE NL4^3 aenbank.SEQ 

4 6 97 AAA GAA TTA AAG AAA ATT ATA GGA CAG GTA AGA GAT CAG GCT GAA 

4 655 KELKKIIGC'VRDQAE pNL4-3.seq 

4 6 95 AAA GAA TTA AAG AAA ATT ATA GGA CAG GTA AGA GAT CAG GCT GAA 

5222 K E L K K I I G C V R D Q A E pHDMKgpmZ . sec 

52^2 AAG GAG CTG AAG AAG ATC ATC GGC CAA GTC CGC GAC CAG GCC GAG 



4760 

4742 ~ « T A V Q M A V F i H N F~ NL4-3 aenbank.SEC 

4 7 42 CAT CTT AAG ACA GCA GTA CAA ATG GCA GTA TTC ATC CAC AAT TTT 

4 7 40 K L K T A V Q MAY F I K n f- 

4 74 0 CAT CTT AAG ACA GCA GTA CAA ATG GCA GTA TTC ATC CAC AAT TTT 
5267 HLKTAVQMAVFI HN F 

5 2 6" CAC CTG AAG ACC GCC GTG. CAG ATG GC 2 GTG TTC ATC CAC AAC TTC 



pNL4-3 . seq 
pHDMHgpm2 . seq 
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4 877 
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GTG GAC ATC ATC GCC ACC GAC ATC CAG ACC AAG GAG 
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Q K 
CAG AAG 
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4922 
4920 
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5492 
5492 
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AGCTTGGCCC ATTGCATACG TTGTATCCAT ATCATAATAT GTACATTTAT ATTGGCTCAT 60 

GTCCAACATT ACCGCCATGT TGACATTGAT TATTGACTAG TTATTAATAG TAATCAATTA 12 0 

CGGGGTCATT AGTTCATAGC CCATATATGG AGTTCCGCGT TACATAACTT ACGGTAAATG 18 0 

GCCCGCCTGG CTGACCGCCC AACGACCCCC GCCCATTGAC GTCAATAATG ACGTATGT~C 24 0 

CC ATAGTAAC GCCAATAGGG ACTTTCCATT GACGTCAATG GGTGGAGTAT TTACGGTAAA 3 00 

CTGCCCACTT GGCAGTACAT CAAGTGTATC ATATGCCAAG TACGCCCCCT ATTGACGTCA 3 6 0^ 

ATGACGGTAA ATGGCCCGCC TGGCATTATG CCCAGTACAT GACCTTATGG GACTT^CC^A 42 0 

CTTGGCAGTA CATC T AC GTA TTAGTCATCG CTATTACCAT GGTGATGCGG TTTTGGCAGT 48 0 

ACATCAATGG GCGTGGATAG CGGTTTGACT CACGGGGATT TCCAAGTCTC CACCCCATTG 540 

ACGTCAATGG GAGTTTGTTT TGGCACCAAA ATCAACGGGA CTTTCCAAAA TGTCGTAACA 60 0 

ACTCCGCCCC ATTGACGCAA ATGGGCGGTA GGCGTGTACG GTGGGAGGTC TATATAAGCA 66 0 

GAGCTCGTTT AGTGAACCGT CAGATCGCCT GGAGACGCCA TCCACGCTGT TTTGACCTCC 720 

ATAGAAGACA CCGGGACCGA TCCAGCCTCC CCTCGAAGCT GATCCTGAGA ACTTCAGGGT 780 

GAGTCTATGG GACCCTTGAT GTTTTCTTTC CCCTTCTTTT CTATGGTTAA GTTCATGTCA 840 

TAGGAAGGGG AGAAGTAACA GGGTACACA? ATTGACCAAA TCAGGGTAAT TTTGCATTTG 90 0 

TAATTTTAAA AAATGCTTTC TTCTTTTAA? ATACTTTTTT GTTTATCTTA TTTCTAATAC 9 60 

TTTCCCTAAT CTCTTTCTTT CAGGGCAATA ATGATACAAT GTATCATGCC TCTTTGCACC 102 0 

ATTCTAAAGA ATAACAGTGA TAATTTCTGG GTTAAGGCAA TAGCAATATT TCTGCATATA 10 80 

AATATTTCTG CATATAAATT GTAACTGATG TAAGAGGTTT CATATTGCTA ATAGCAGCTA 114 0 

CAATCCAGCT ACCATTCTGC TTTTATTTTA TGGTTGGGAT AAGGCTGGAT TATTCTGAGT 12 00 

CCAAGCTAGG CC CTTTTGCT AATCATGTTC ATACCTCTTA TCTTCCTCCC ACAGCTCCTG 12 60 

GGCAACGTGC TGGTCTGTGT GCTGGCCCAT CACTTTGGCA AAGAATTCTA GACTGCCATG 13 2 0 

GQCGCCCGCG CCTCCGTGCT GTCCGGCGGC G AGC TGG AC A AGTGGGAGAA GATCCGCCTG 13 80 

CGCCCCGGCG GCAAGAAGCA GTACAAGCTG AAGCACATCG TGTGGGCCTC CCGCGAGCTG 1440 

GAGCGCTTCG CCGTGAACCC CGGCCTGCTG GAGACCTCCG AGGGCTGCCG CCAGATCCTG 15 00 

GGCCAGC^GC AGCCCTCCCT GCAAACCGGC TCCGAGGAGC TGCGCTCCCT GTACAACACC 15 6 0 

ATCGC CGTGC TGTACTGCGT GCACCAGCGC ATCGACGTGA AGGACACCAA GGAGGCCCTG 162 0 

GACAAGATCG AGGAGGAGCA GAACAAGTCC AAGAAGAAGG CCCAGCAGGC CGCCGCCGAC 1680 

ACCGGCAACA ACTCCCAGGT GTCCCAGAAC TACCCCATCG TGCAGAACCT GCAGGGCCAG 17 40 

ATGGTGCACC AGGCCATCTC CCCCCGCACC CTGAACGCCT GGGTGAAGGT GGTGGAGGAG 18 0 0 

AAGGCCTTCT CCCCCGAAGT CATCCCCATG TTCTCCGCCC TGTCCGAGGG CGCCACCCCC 1860 

CAGGACCTGA ACACCATGCT GAACACCGTG GGCGGCCACC AGGCCGCCAT GCAGATGCTG 192 0 

AAGGAGACCA TCAACGAGGA GGCCGCCGAG TGGGACCGCC TGCACCCCGT GCACGCCGGC 1980 

CCCATCGCCC CCGGCCAGAT GCGCGAGCCC CGCGGCTCCG ACATCGCCGG CACCACCTCC 2 040 

ACCCTGCAAG AGCAGATCGG CTGGATGACC CACAACCCCC CCATCCCCGT GGGCGAGATC 210 0 

TACAAGCGCT GGATCATCCT GGGCCTGAAC AAGATC GTGC GCATGTACTC CCCCACCTCC 216 0 

ATCCTGGACA TCCGCCAGGG CCCCAAGGAG CCCTTCCGCG ACTACGTGGA CCGCTTCTAC 2220 

AAGACCCTGC GCGCCGAGCA GGCCTCCCAG GAGGTAAAGA ACTGGATGAC CGAGACCCTG 2280 

CTGGTGCAGA ACGCCAACCC CGACTGCAAG ACCATCCTGA AGGCCCTGGG CCCCGGCGCC 2 340 

ACCCTGGAGG AGATGATGAC CGCCTGCCAG GGCGTGGGCG GCCCCGGCCA CAAGGCCCGC 2 400 

GTGCTGGCCG AGGCCATGTC CCAAGTCACC AACCCCGCCA CCATCATGAT CCAGAAGGGC 2460 

AACTTCCGCA ACCAGCGCAA GACCGTGAAG TGCTTCAACT GCGGCAAGGA GGGCCACATC 2520 

GCCAAGAACT GCCGCGCCCC CCGCAAGAAG GGCTGCTGGA AGTGCGGCAA GGAGGGCCAC 25 80 

CAGATGAAAG ATTGTACTGA GAGACAGGCT AATTTTTTAG GGAAGATCTG GCCTTCCCAC 2640 

AAGGGAAGGC CAGGGAATTT TCTTCAGAGC AGACCAGAGC CAACAGCCCC ACCAGAAGAG 2700 

AGCTTCAGGT TTGGGGAAGA GACAACAACT CCCTCTCAGA AGCAGGAGCC GATAGACAAG 27 6 0 

GAACTGTATC CTTTAGCTTC CCTCAGATCA CTCTTTGGCA GCGACCCCTC GTCACAATAA 282 0 



Fig. 10A 



BNSDOCID <WO 0015B19A1 I 



WO 00/15819 



PCT/L'S99/20675 



26/29 



AGATCGGTGG CCAGCTGAAG GAGGCCCTGC TGGACACCGG CGCCGACGAC ACCGTGCTG j 2 88 0 

AGGAGATGAA CCTGCCCGGC CGCTG3AAGC C ZAAGATGAT ^GGCGGCATC GGCGGCTTCA 2 94 0 

TCAAAGTCCG CCAGTACGAZ CAGATTCTGA T7GAGAT-TG :3GCCACAAG GCCATCGGCA 3000 

CCGTGCTGGT 3GGCCCCAC: CCZGTGAACA T:AT:GGCGG CAACCTGCTG ACCCAGATCG 3 06 0 

GCTGCACCCT ^AACTTCCCC ATCTGCCCCA TTGAGACCGT GCCCGTGAAG CTGAAGCCCG 3120 

GCATGGACGG CCCCAAAGTC AAGCAGTGGC CZCTGACCGA GGAGAAGATC AAGGCCCTGG 3 ISO 

TGGAGATCTG CACCGAGATG GAGAAGGAG 3 GTAAGATCTC TAAGATCGGC CCCGAGAACC 3240 

CCTACAACAC CCCCGTGTTC GCCATCAAGA AGAAGGACTC CACCAAGTGG CGCAAGCTGG 33 00 

TGGACTTCCG CGAGCTGAAT AAGCGCACCG AGGACTTGTG GGAGGTGCAG CTGGGCATCC 33 60 

CCCACCCCGC CGGCCTGAAG CAGAAGAAGT CCGTGACGGT GCTGGACGTG GGCGACGCCT 342 0 

ACTTCTCCGT GCCCCTGGAC AAGGACTTCC GC AAGTACAC CGCCTTCACC ATCCCCTCCA 3 48 0 

TCAACAACGA GACCCCCGGC ATCCGCTACC AGTACAACGT GCTGCCCCAG GGCTGGAAGG 3 54 0 

GCTCCCCCGC CATCTTCCAG TGCTCCATGA CCAAGATCCT GGAGCCCTTC CGCAAGCAGA 3 600 

ACCCCGACAT CGTGATCTAC CAGTACATGG ACGACCTGTA CGTGGGCTCC GACCTGGAGA 3 660 

TCGGCCAGCA CCGCACCAAG ATCGAGGAGT TGCGCCAGCA CCTGCTGCGC TGGGGCTTCA 3 72 0 

CCACCCCCGA CAAGAAGCAC CAGAAGGAGC CCCCCTTCCT GTGG AT GGGC TACGAGCTGC 3780 

ACCCCGACAA GTGGACCGTG CAGCCCATCG TGCTGCCGGA GAAGGACTCC TGGACCGTGA 3 84C 

ACGACATCCA GAAGCTGGTG GGCAAGCTGA ACTGGGCCTC CCAGATCTAC GCCGGCATCA 3 900 

AAGTCCGCCA GCTGTGCAAG CTGCTGCGCG GCACCAAGGC CCTGACCGAG GTGGTGCCCC 3 9 60 

TGACCGAGGA GGCGGAGCTG GAGCTGGCCG AGAACCGCGA GATCCTGAAG GAGCCCGTGC 4 02 0 

ACGGCGTGTA CTACGACCCC TCCAAGGACC TGATCGCCGA GATCCAGAAG CAGGGCCAGG 4 0 80 

GCCAGTGGAC CTACCAGATC TACCAGGAGC CCTTCAAGAA CCTGAAGACC GGCAAATACG 414 0 

CCCGCATGAA GGGCGCCCAC ACCAACGACG TGAAGCAjCT GACCGAGGCC GTGCAGAAGA 42 00 

TCGCCACCGA GTCCATCGTG ATCTGGGGCA AGACTCCCAA GTTCAAGCTG CCCATCCAGA 42 60 

AGGAGACCTG GGAGGCCTGG TGGACCGAGT ACTGGCAGGC CACCTGGATC CCCGAGTGGG 43 2 0 

AGTTCGTGAA CP^CCCCCCCC CTGGTGAAGC TGTGGTACCA GCTGGAGAAG GAGCCCATCA 43 8 0 

TCGGCGCCGA GACCTTCTAC GTGGACGGCG CCGCCAACCG CGAGACCAAG CTGGGCAAGG 4440 

CCGGCTACGT GACCGACCGC GGCCGCCAGA AGGTGGTGCC CCTGACCGAC ACCACCAACC 4500 

AGAAGACCGA GCTGCAGGCC ATCCACCTGG CCCTGCAAGA CTCCGGCCTG GAGGTGAACA 4560 

TCGTGACCGA CTCCCAGTAT GCATTGGGCA TCATCCAGGC CCAGCCCGAC AAGTCCGAGT 4 62 0 

CCGAGCTGGT GTCCCAGATC ATCGAGCAGC TGATCAAGAA GGAGAAGGTG TACCTGGCCT 4 680 

GGGTGCCCGC CCACAAGGGC ATCGGCGGCA ACGAGCAGGT GGACAAGCTG GTGTCCGCCG 474 0 

GCATCCGCAA GGTGCTGTTC CTGGACGGCA TCGACAAGGC CCAGGAGGAG CACGAGAAGT 48 00 

ACCACTCCAA CTGGCGCGCC ATGGCCTCCG ACTTCAACCT GCCCCCCGTG GTGGCCAAGG 4860 

AGATCGTGGC CTCCTGCGAC AAGTGCCAGC TGAAGGGCGA GGCCATGCAC GGCCAGGTGG 492 0 

ACTGCTCCCC CGGCATCTGG CAGCTGGACT GCACCCACCT GGAGGGCAAG GTGATCCTGG 49 80 

TGGCCGTGCA CGTGGCCTCC GGCTACATCG AGGCCGAGGT GATCCCCGCC GAGACCGGCC 5 04 0 

AGGAGACCGC CTACTTCCTG CTGAAGCTGG CCGGCCGCTG GCCCGTGAAG ACCGTGCACA 5100 

CCGACAACGG CTCCAACTTC ACCTCCACCA CCGTGAAGGC CGCCTGCTGG TGGGCCGGCA 516 0 

TCAAGCAGGA GTTCGGCATC CCCTACAACC CCCAGTCCCA GGGCGTGATC GAGTCCATGA 522 0 

ACAAGGAGCT GAAGAAGATC ATCGGCCAAG TCCGCGACCA GGCCGAGCAC CTGAAGACCG 528 0 

CCGTGCAGAT GGCCGTGTTC ATCCACAACT TCAAGCGCAA GGGCGGCATC GGCGGCTACT 53 4 0 

CCGCCGGCGA GCGCATCGTG GACATCATCG CCACCGACAT CCAGACCAAG GAGCTGCAGA 54 0C 

AGCAGATCAC CAAGATCCAG AACTTCCGCG TGTACTACCG CGACTCCCGC GACCCCGTGT 546 0 

GGAAGGGCCC CGCCAAGCTG CTGT3GAAGG GCGAGGGCGC CGTGGTGATC CAGGACAACT 552 0 

CCGACATCAA GGTGGTGCCC CGCCGCAAGG CCAAGATCAT CCGCGACTAC GGCAAGCAGA 5580 

TGGCCGGCGA CGACTGCGTG GCCTCCCGCC AGGACGAGGA CTAACACATG GAAAAGATTA 5 64 0 
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GTAAAACACC ATAGGCCGCT CTAGAGGATC 
CCCAGATCTA ATTCACCCCA CCAGTGCAGG 
CTAATGCCCT GGCCCACAAG TATCACTAAG 
AGGTTCCTTT GTTCCCTAAG TCCAACTACT 
CATCTGGATT CTGCCTAATA AAAAACATTT 
TTCTGAATAT TTTACTAAAA AGGGAATGTG 
TGAAGAGCTA GTTCAAACCT TGGGAAAATA 
GAGGCTGCAA ACAGCTAATG CACATTGGCA 
CCTCAGAAAA GGATTCAAGT AGAGGCTTGA 
TTTACATTAC TTATTGTTTT AGCTGTCCTC 
TCCTGCATCT CTCAGCCTTG ACTCCACTCA 
CTGAAGTGTT CCTTCCATGT TTTACGGCGA 
TTAGTTGTCT CTGTTGTCTT ATAGAGGTCT 
TTGACTGTCC TGTGAGCCCT TCTTCCCTGC 
CGACATGGCA GTCTAGATCA TTCTTGAAGA 
TAGGTTAATG TCATGATAAT AATGGTTTCT 
GTGCGCGGAA CCCCTATTTG TTTATTTTTC 
AGACAATAAC CC TGATAAAT GCTTCAATAA 
CATTTCCGTG TCGCCCTTAT TCCCTTTTTT 
CCAGAAACGC TGGTGAAAGT AAAAGATGCT 
ATCGAACTGG ATCTCAACAG CGGTAAGATC 
CCAATGATGA GCACTTTTAA AGTTCTGCTA 
GGGCAAGAGC AACTCGGTCG CCGCATACAC 
CCAGTCACAG AAAAGCATCT TACGGATGGC 
ATAACCATGA GTGATAACAC TGCGGCCAAC 
GAGCTAACCG CTTTTTTGCA CAACATGGGG 
CCGGAGCTGA ATGAAGCCAT ACCAAACGAC 
GCAACAACGT TGCGCAAACT ATTAACTGGC 
TTAATAGACT G GAT GG AG GC GGATAAAGTT 
GCTGGCTGGT TTATTGCTGA TAAATCTGGA 
GC AGCACTGG GGCCAGATGG TAAGCCCTCC 
CAGGCAACTA TGGATGAACG AAATAGACAG 
CATTGGTAAC TGTCAGACCA AGTTTACTCA 
TTTTAATTTA AAAGG AT C T A GGTGAAGATC 
TAACGTGAGT TTTCGTTCCA CTGAGCGTCA 
TGAGATCCTT TTTTTCTGCG CGTAATCTGC 
GCGGTGGTTT GTTTGCCGGA TCAAGAGCTA 
AGCAGAGCGC AGAT AC C AAA TACTGTTCTT 
AAGAACTCTG TAGCACCGCC TACATACCTC 
GCCAGTGGCG ATAAGTCGTG TCTTACCGGG 
GCGCAGCGGT CGGGCTGAAC GGGGGGTTCG 
TACACCGAAC TGAGATACCT ACAGCGTGAG 
AGAAAGGCGG ACAGGTATCC GGTAAGCGGC 
CTTCCAGGGG GAAACGCCTG GTATCTTTAT 
GAGCGTCGAT TTTTGTGATG CTCGTCAGGG 
GGATGCGCCG CGTGCGGCTG CTGGAGATGG 
TGGTTTGCGC ATTCACAGTT CTCCGCAAGA 



CAAGCTTATC GATACCGTCG ACCTCGAGGG 57 0 0 

CTGCCTATCA GAAAGTGGTG GCTGGTGTGG 57 6 0 

CTCGCTTTCT TGCTGTCCAA TTTCTATTAA 5820 

AAACTGGGGG ATATTATGAA GGGCCTTGAG 5880 

ATTTTCATTG CAATGATGTA TTTAAATTAT 5 940 

GGAGGTCAGT GCATTTAAAA CATAAAGAAA 60 0 0 

CACTATATCT TAAACTCCAT GAAAGAAGGT 60 60 

ACAGCCCCTG ATGCCTATGC CTTATTCATC 6120 

TTTGGAGGTT AAAGTTTTGC TATGCTGTAT 6180 

ATGAATGTCT TTTCACTACC CATTTGCTTA 6240 

GTTCTCTTGC TTAGAGATAC CACCTTTCCC 63 0 0 

GATGGTTTCT CCTCGCCTGG CCACTCAGCC 6360 

ACTTGAAGAA GGAAAAACAG GGGGCATGGT 642 0 

CTCCCCCACT C AC AGTG AC C CGGAATCCCT 648 0 

CGAAAGGGCC TCGTGATACG CCTATTTTTA 654 0 

TAGACGTCAG GTGGCACTTT TCGGGGAAAT 6600 

TAAATACATT CAAATATGTA TCCGCTCATG 66 60 

TATTGAAAAA GGAAGAGTAT GAGTATTCAA 672 0 

GCGGCATTTT GCCTTCCTGT TTTTGCTCAC 678 0 

GAAGATCAGT TGGGTGCACG AGTGGGTTAC 6 840 

CTTGAGAGTT TTCGCCCCGA AGAAC GTTTT 69 0 0 

TGTGGCGCGG TATTATCCCG TATTGACGCC 69 60 

TATTCTCAGA ATGACTTGGT TGAGTACTCA 7 02 0 

ATGACAGTAA GAGAATTATG CAGTGCTGCC 7 08 0 

TTACTTCTGA CAACGATCGG AGGACCGAAG 7140 

GATCATGTAA CTCGCCTTGA TCGTTGGGAA 72 00 

GAGCGTGACA CCACGATGCC TGTAGCAATG 72 6 0 

GAACTACTTA CTCTAGCTTC CCGGCAACAA 73 2 0 

GCAGGACCAC TTCTGCGCTC GGCCCTTCCG 73 80 

GCCGGTGAGC GTGGGTCTCG CGGTATCATT 7440 

CGTATCGTAG T TAT C T AC AC GACGGGGAGT 7 5 00 

ATCGCTGAGA TAGGTGCCTC ACTGATTAAG 756 0 

TATATACTTT AG ATTGATT T AAAAC TTCAT 7 62 0 

CTTTTTGATA ATCTCATGAC CAAAATCCCT 7680 

GACCCCGTAG AAAAGATCAA AGGATCTTCT 7740 

TGCTTGCAAA CAAAAAAACC ACCGCTACCA 7800 

CCAACTCTTT TTCCGAAGGT AACTGGCTTC 7860 

CTAGTGTAGC CGTAGTTAGG CCACCACTTC 7 92 0 

GCTCTGCTAA TCCTGTTACC AGTGGCTGCT 7980 

TTGGACTCAA G AC GATAGTT ACCGGATAAG 8040 

TGCACACAGC CCAGCTTGGA GCGAACGACC 810 0 

C TAT GAG AAA GCGCCACGCT TCCCGAAGGG 8160 

AGGGTCGGAA CAGGAGAGCG CACGAGGGAG 822 0 

AGTCCTGTCG GGTTTCGCCA CCTCTGACTT 82 80 

GGGCGGAGCC TATGGAAAAA CGCCAGCAAC 834 0 

CGGACGCGAT GGATATGTTC TGCCAAGGGT 8400 

ATTGATTGGC TCCAATTCTT GGAGTGGTGA 84 60 
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ATCCGTTAGC GAGGTGCCGC CGGCTTCCA7 TCAGGTCGAG GTGGCCCGGC TCCATGCACC 852 0 

GCGACGCAAC GCGGGGAGGC AGACAAGGTA TAGGGCGGCG CCTACAATCC ATGCCAACCC 3 58 0 

GTTCCATGTG CTCGCCGAGG CGGCATAAAT CCCCGTGACG ATCAGCGGTC CAATGATCGA 864 0 

AGTTAGGCTG GTAAGAGCCG CGAGCGATCC TTGAAGCTGT CCCTGATGGT GGTCATCTAC 87 0 0 

CTGCCTGGAC AGCATGGCCT GCAACGCGGG CATCCCGATG CCGCCGGAAG CGAGAAGAAT 876 C 

CATAATGGGG AAGGCCATCC AGCCTCGCGT CGGGGAGCTT TTTGCAAAAG CCTAGGCCTC 882 0 

CAAAAAAGCC TCCTCACTAC TTCTGGAATA GCTCAGAGGC CGAGGCGGCC TCGGCCTCTG 8 88 0 
CATAAATAAA AAAAATTAGT CAGCCA7G 8 90 8 
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